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Preface 



The Workshop on Approximation and Online Algorithms (WAOA 2003) focused 
on the design and analysis of algorithms for online and computationally hard 
problems. Both kinds of problems have a large number of applications aris- 
ing from a variety of fields. The workshop also covered experimental research 
on approximation and online algorithms. WAOA 2003 took place in Budapest, 
Hungary, from September 16 to September 18. The workshop was part of the 
ALGO 2003 event, which also hosted ESA 2003, WABI 2003, and ATMOS 2003. 

Topics of interest for WAOA 2003 were: competitive analysis, inapproximabil- 
ity results, randomization techniques, approximation classes, scheduling, coloring 
and partitioning, cuts and connectivity, packing and covering, geometric prob- 
lems, network design, and applications to game theory and financial problems. 
In response to our call for papers we received 41 submissions. Each submission 
was reviewed by at least 3 referees, who judged the papers on originality, quality, 
and consistency with the topics of the conference. Based on these reviews the 
program committee selected 19 papers for presentation at the workshop and for 
publication in this proceedings. This volume contains the 19 selected papers and 
5 invited abstracts from an ARACNE minisymposium which took place as part 
of WAOA. 

We would like to thank all the authors who responded to the call for papers 
and the invited speakers who gave talks at the ARACNE minisymposium. We 
specially thank the local organizer Janos Csirik, the members of the program 
committee 

— Susanne Albers (University of Freiburg) 

— Evripidis Bampis (University of Evry) 

— Danny Chen (University of Notre Dame) 

— Amos Fiat (Tel Aviv University) 

— Rudolf Fleischer (Honk Kong University of Science and Technology) 

— Pino Persiano (University of Salerno) 

— Jose Rolim (University of Geneva) 

— Martin Skutella (Max-Planck-Institut fiir Informatik) 

and the subreferees 

— Aleksei Fishkin 

— Dimitris Fotakis 

— Hal Kierstead 

— Arie Koster 

— Michael Langberg 

— Christian Liebchen 

— Ulrich Meyer 

— Guido Schaefer 

— Guochuan Zhang 

— Uri Zwick 
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Online Coloring of Intervals with Bandwidth 
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Abstract. Motivated by resource allocation problems in communication net- 
works, we consider the problem of online interval coloring in the case where the 
intervals have weights in (0, 1] and the total weight of intersecting intervals with 
the same color must not exceed 1. We present an online algorithm for this prob- 
lem that achieves a constant competitive ratio. Our algorithm is a combination of 
an optimal online algorithm for coloring interval graphs and First-Fit coloring, for 
which we generalize the analysis of Kierstead to the case of non-unit bandwidth. 

1 Introduction 

Online coloring of intervals is a classical problem whose investigation has led to a num- 
ber of interesting insights into the power and limitations of online algorithms. Intervals 
are presented to the online algorithm in some externally specified order, and the algo- 
rithm must assign each interval a color that is different from the colors of all previously 
presented intervals intersecting the current interval. The goal is to use as few colors as 
possible. If the maximum clique size of the interval graph is to, it is clear that jj colors 
are necessary (and also sufficient in the offline case [6]). Kierstead and Trotter presented 
an online algorithm using at most 3 u — 2 colors and showed that this is best possible 
[11]. Another line of research aimed at analyzing the First-Fit algorithm, i.e., the algo- 
rithm assigning each interval the smallest available color. It is known that First-Fit may 
need at least 4.4 u colors [3] on some instances and is therefore not optimal. A linear 
upper bound of 40 u was first presented by Kierstead [8] and later improved to 25.8 uj 
by Kierstead and Qin [10]. 

In this paper, we study a generalization of the online interval coloring problem 
where each interval has a weight in (0, 1]. Motivated by applications, we refer to the 
weights as bandwidth requirements or simply bandwidths. A set of intervals can be as- 
signed the same color if for any point r on the real line, the sum of the bandwidths of its 
intervals containing r is at most 1. The special case where every interval has bandwidth 
1 corresponds to the original online interval coloring problem. Our problem is thus a 
simultaneous generalization of online interval coloring and online bin-packing. 

Our main result is an online algorithm that achieves a constant competitive ratio 
for online coloring of intervals with bandwidth requirements. The algorithm partitions 

* Partially supported by the Swiss National Science Foundation under Contract No. 21-63563.00 
(Project AAPCN) and the EU Thematic Network APPOL II (IST-2001-32007), with funding 
provided by the Swiss Federal Office for Education and Science. 

R. Solis-Oba and K. Jansen (Eds.): WAOA 2003. LNCS 2909. pp. 1-12, 2004. 

© Springer- Verlag Berlin Heidelberg 2004 




2 



Udo Adamy and Thomas Erlebach 



the intervals into two classes and applies First-Fit to one class and the algorithm by 
Kierstead and Trotter [1 1] to the other class. In order to analyze First-Fit in our context, 
we extend the analysis by Kierstead [8] to the setting where the intervals have arbitrary 
bandwidth requirements in (0, 1/2]. 



1.1 Applications 

Besides its theoretical interest, investigating the online coloring problem for intervals 
with bandwidth requirements is motivated by several applications. 

First, imagine a communication network with line topology. The bandwidth of each 
link is partitioned into channels, where each channel has capacity 1. The channels could 
be different wavelengths in an all-optical WDM (wavelength-division multiplexing) 
network or different fibers in an optical network supporting SDM (space-division multi- 
plexing), for example. Connection requests with bandwidth requirements arrive online, 
and each request must be assigned to a channel without exceeding the capacity of the 
channel on any of the links of the connection. We assume that the network nodes do 
not support switching of traffic from one channel to another channel (which is the case 
if only add-drop multiplexers are used). Then a connection request from a to b corre- 
sponds to an interval [a, b) with the respective bandwidth requirement, and the problem 
of minimizing the number of required channels to serve all requests is just our online 
coloring problem for intervals with bandwidth requirements. 

A related scenario in a line network is that the connection requests have unit dura- 
tions and the goal is to serve all connections in a schedule of minimum duration. In this 
case, the colors correspond to time slots, and the total number of colors corresponds to 
the schedule length. This is a special case of the call scheduling problem considered, 
for example, in [5,4], 

Finally, an interval could represent a time period during which a job must be pro- 
cessed, and the bandwidth of the interval could represent the fraction of a resource 
(machine) that the job needs during its execution. At any point in time, a machine can 
execute jobs whose bandwidths (here, resource requirements) sum up to at most 1. If 
jobs arrive online (before the actual schedule starts) and have to be assigned to a ma- 
chine immediately, with the goal of using as few machines as possible, we again obtain 
our online interval coloring problem with bandwidth requirements. 



1.2 Related Work 

We have already discussed previous work on online coloring of intervals (without band- 
width requirements) in the beginning of the introduction. A survey of results for online 
graph coloring can be found in [9]. The problem of assigning colors to paths that repre- 
sent connection requests in communication networks has been studied intensively due 
to its motivation by all-optical WDM networks. A survey of offline results can be found 
in [2], Online path coloring results for trees and meshes are given in [1]. In these path 
coloring problems, the bandwidth requirement of each path is 1, implying that no two 
intersecting paths can get the same color. 
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2 Preliminaries 

We are given a collection X of closed intervals over the real numbers R, where each 
interval i is associated with a bandwidth requirement b(i), where 0 < b(i) < 1. In an 
online scenario these intervals are processed one by one in some externally determined 
sequence. Whenever an interval i arrives, the algorithm knows the previously processed 
intervals and the colors assigned to them, but it knows nothing about the unprocessed 
intervals arriving in the future. Based on this knowledge the algorithm irrevocably as- 
signs a color /(*) to the interval i, in such a way that for every color x and every point 
r £ R the sum of the bandwidths of the intervals containing r that are assigned color x 
is at most 1. 

We compare the performance of an online algorithm with the number of colors used 
by an optimal offline algorithm, denoted by OPT. More precisely, we call an online 
algorithm A c-competitive if A(X) < c • OPT (I) for all input sequences X. 

Letl = {*i, . . . , i[\ be the collection of intervals and let b(it) denote the bandwidth 
requirement of interval i t , i.e. 0 < b(it) < 1 for t = 1,. See Fig. 1 for an example 
with i = 8 intervals depicted as rectangles. Their projections onto the real line are 
exactly the intervals over the real numbers, and the height of a rectangle shows the 
bandwidth requirement of the corresponding interval. Let ( X , <) be the partial order of 
the intervals in X, where the relation i < j holds if and only if the right endpoint of 
interval i is less than the left endpoint of interval j, e.g. we have < i 5 in Fig. 1. Let 
LCIbea subset of intervals. For an interval i £ X, we write i < L (L < i) if for all 
intervals j £ L, i < j ( j < i) holds. The neighborhood of an interval i G X is the set 
of intervals in X that are different from i and intersect i. It is denoted by N(i). In the 
example, the neighborhood of interval z 4 is N(k) = {k, *3}. 



*2 











h 


k 




*6 




*7 











1 1 1 1 1 1 1 1 1 1 ► R 

0 123456789 



Fig. 1 . The set X — {*i , . . . , ig} of intervals. 



The density D(r | L) of L at a point r £ R is the sum of the bandwidths of the 
intervals in L that contain the point r, i.e. D{r \ L ) = Y^rejeL^(j)- The density 
D{i | L) of an interval / G X with respect to L is the minimum density I)(r \ L) over 
all points r in the interval i, i.e. D(i \ L) = min{D(r | L) : r £ i}. With D(L) we 
denote the maximum density of L, that is D(L) = max{ D(r \ L) : r £ R}. In our 
example shown in Fig. 1, the density D( 3 | X) = 6(12) + ^(*3) + fe(*4). The density 



4 



Udo Adamy and Thomas Erlebach 



of the interval i 3 with respect to X is D(i 3 \ X) = b(i 2 ) + b(i 3 ) and it is achieved at 
the point 2 for example. Finally, the maximum density D(X ) = b(i 2 ) + b(i$) + b(ie ) 
is achieved at point 5. 

Further, we need some notation for classifying intervals in terms of their left to 
right order (X, <). A chain C is a sequence of intervals, where i < j holds for any 
two consecutive intervals i and j in this sequence C . Therewith, we define the height 
h(i | L) of an interval i with respect to L to be the length of the longest chain C in L 
such that C < i. Similarly, the depth d(i \ L) is the length of the longest chain C in 
L such that i < C. The centrality of an interval i with respect to L is then given by 
c(i | L ) = min {h(i \ L),d(i \ L)}. Looking again at the example presented in Fig. 1, 
the height of interval i 4 with respect to X is h (14 | X) = 1 since the longest chain 
consists only of the interval i\. The corresponding depth d(i 4 | X) equals 3 because 
of the chain i e < i-j < ig of length 3. Hence, the centrality c (*4 | X ) is given by 
min{l, 3} = 1. 

Notice that for a subset L C TV (j), if an interval i contains an endpoint of the 
interval j then c(i \ L) = 0, and if c(i \ L) > 1 then the interval i is contained in the 
interval j, i.e. i C j. 

For our analysis we need two lemmas regarding the existence of an interval with 
high density in a clique of intervals. The first lemma is a generalization of a lemma that 
appeared in [12, 7] for the case of interval graphs, i.e. for the setting where all intervals 
have unit bandwidth. 



Lemma 1. LetX be a collection of intervals. If L is a clique in the intersection graph 
ofX (meaning that any two intervals in L intersect), then there exists an interval i £ L 
such that 



D(i | L) > 



D{L) 

2 



Proof Let L = { 1 - t , . . . , // }. Since the intervals in L form a clique, there is a point 
r£R that is contained in every interval in L and we have 



t 

D(L) = max{Z?(r | L) : r £ R} = b(i m ). 

m= 1 

We build the sequence S of bandwidths of the intervals in L from left to right. 
Whenever an interval i m starts or ends, we append its bandwidth b(i m ) to the sequence 
S. Thus, the length of the sequence S is 2 1 since every bandwidth of intervals in L 
appears twice, once for the left endpoint and once for the right endpoint. Starting from 
the beginning of S, let S\ be the largest initial subsequence of elements in S obeying 
the condition that their sum is less than D(L)/ 2. Likewise, let S 2 be the largest subse- 
quence starting from the end of S (going backwards) under the condition that the sum 
of elements in S 2 is less than D(L)/2. 

Now, we claim that there exists an interval i whose bandwidth appears neither in 
the subsequence ,S'-| nor in the subsequence S 2 . This is the case, because otherwise 
all bandwidths of intervals would appear either in S] or in S 2 , which, in turn, means 
that their total sum would be at least D(L) contradicting the fact that each of the two 
subsequences sums up to less than D(L)/2. 
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Every such interval i has a density D(i \ L) > D(L)/2. This follows from the 
construction of the sequences ,Sj and S 2 and from the fact that in a left-to-right scan 
through a clique of intervals all left interval endpoints precede all right interval end- 
points. □ 

Lemma 2. Let L be a clique of inten’als each of which is assigned to a different color. 
If every interval in L has density at least p < 1 within its color class, then there exists 
an interval i £ L such that the density ofi with respect to the union of the present color 
classes is at least p ■ \L\/2. 

Proof. This lemma can be proven similar to Lemma 1 by building a sequence S from 
a left-to-right scan of the intervals in L, except that this time we append the value p 
whenever an interval in L starts or ends. But we will prove it by induction on the size 
of L. In the base cases \L\ = 1 and \L\ = 2 any interval in L has the desired property, 
since \L\/2 < 1. For the induction step, let j be the interval of L with the smallest 
left endpoint and let k be the interval of L with the largest right endpoint. Using the 
induction hypothesis, we choose an interval i £ L\ { j , k} such that the density of the 
interval i with respect to the union of color classes of intervals in L \ { j , k} is at least 
p ■ (|L|/2 — 1). Since the intervals j and k intersect, the interval i is contained in j U k, 
and because the intervals j and k have density p within their color classes, the density 
of the interval i with respect to the union of all present color classes is at least p ■ \L\/2. 

□ 

Finally, if N = (m, . . . , n t ) is a sequence of length t we denote the initial subse- 
quence of N of length k by N ^ = (ni, . . . , n^.). The result (rii, ... ,n t ,n) of appending 
n to the end of N is denoted by N o n. 

3 The Algorithm 

Our algorithm works with two sets C\ and C 2 of disjoint colors. Let 0 = (T 102 ■■■ be 
any sequence of intervals. Each er; is associated with a bandwidth b(oi), 0 < b(oi) < 1. 
Whenever an interval (Ji arrives, we process it according to its bandwidth b(oi). 

If b(oi) < 1/2, the algorithm assigns the interval Ui a color from the set Ci of 
colors using the First-Fit principle. The First-Fit assignment is made by coloring the 
interval n, with color a, where a is the smallest color in C-\ such that for every r £ Oi, 
the sum of the bandwidths of intervals previously colored a and containing r is at most 
1 - b(ai). 

Ifb(oi) > 1 /2, the algorithm assigns the interval Oi a color from the set C 2 using the 
optimal online interval coloring algorithm of Kierstead and Trotter [11]. Note that the 
intervals having bandwidth exceeding 1/2 have the same property as intervals without 
bandwidth requirement, namely, if two intervals intersect, they have to receive different 
colors in a proper coloring. 

We remark that the partition into two classes is indeed necessary in our approach, 
because First-Fit can be arbitrarily bad for intervals with bandwidths in (0, 1] as shown 
in Fig. 2. 

Theorem 1. The algorithm solves the online problem of coloring intervals with band- 
widths using at most 195 times as many colors as the optimal offline algorithm. 
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(a) The First-Fit algorithm requires many colors 

for the sequence *i , ii , . . . , ig . 











ii 


*3 


^5 


*7 


*9 



► R 

(b) The optimal offline algorithm needs 

only two colors. 



Fig. 2. For bandwidth requirements in (0, 1] the First-Fit algorithm can perform arbitrarily badly. 



For the proof of this theorem we need two lemmas which we will prove below. Fix 
a sequence a and let X be the set of all intervals appearing in a. Let X\ C X be the 
intervals with bandwidth at most 1/2, and let X 2 C X be the intervals with bandwidth 
exceeding 1/2. We haveZ = Xi UZ 2 . 

Lemma 3. If the number of colors that the algorithm uses from the set C\ is at least 2, 
then this number is less than 192 times the maximum density D(X i) of intervals in X\. 

Lemma 4. The number of colors from the set C 2 that are used by the algorithm is less 
than 3 times the number of colors used by an optimal offline algorithm for coloring the 
intervals in Z 2 . 

Proof, (of Theorem 1 ) Let OPT denote the number of colors used by an optimal offline 
algorithm. Clearly, this number is at least as large as the maximum density of the inter- 
vals, i.e. D(X) < OPT. With OPT(Zi) and OPT(Z 2 ) we denote the number of colors 
assigned by an the optimal offline algorithm to the intervals in X\ and X 2 . respectively. 
Since OPT(Zi) and OPT(Z 2 ) count the number of colors for restricted sets of intervals, 
OPT (Xi ) < OPT and OPT(Z 2 ) < OPT hold true. 

By Lemma 3, the algorithm colors the intervals in X\ using the First-Fit algorithm 
with less than 192 • OPT(Zi) many colors. This is true, because OPT('X| ) > D(X 1 ) and 
if the First-Fit algorithm needs just one color for coloring the intervals in X\, then also 
the optimal offline algorithm needs one color. 



Online Coloring of Intervals with Bandwidth 



7 



By Lemma 4, the algorithm colors the intervals in Z 2 using an optimal online algo- 
rithm for coloring interval graphs with less than 3 times as many colors as an optimal 
offline algorithm. 

Thus, the number C of colors used by the algorithm is 

C < 192 • OPT(Xi) + 3 • OPT(J 2 ) 

< 192 • OPT + 3 • OPT 
= 195 • OPT. 

Hence, the algorithm is 195-competitive. □ 

Let us start with the proof of Lemma 4. 

Proof, (of Lemma 4) The optimal online interval coloring algorithm uses at most 3 • 
w(X 2 ) — 2 colors for coloring the intervals in I 2 [11]. Since the bandwidth of the in- 
tervals in I 2 is greater than 1/2, any two intersecting intervals must receive different 
colors. Therefore any offline algorithm needs at least tu(X 2 ) many colors for coloring 
the intervals in I 2 . □ 

Now we will prove Lemma 3, using a rather technical induction. 

Proof, (of Lemma 3) 

In this proof we generalize a proof of Kierstead, who proved in [8] that First-Fit 
uses less than 40 u> colors for interval coloring, i.e. the case where all bandwidths are 1. 
We extend the proof to the case where all intervals have bandwidth at most 1/2. 

The main idea of the proof is the following: If the First-Fit coloring uses x > 192 • y 
colors, there is a point r C R where the density D(r \ Xf) exceeds y. For y > D(fL i) 
we would arrive at a contradiction, since — by definition of D(X i) — there will not be 
any point whose density with respect to X\ exceeds D(T\ ]. Therefore the number of 
colors used by the First-Fit algorithm is x < 192 • D(I-\ ]. 

Let /(?') denote the color that First-Fit assigns to interval i, and let C(q) = {i £ 
T : f(i) = q} be the set of intervals with color q. For a range of colors we denote by 
C[p, q] = U p< r <qC(r) all intervals whose colors are within that range. 

In order to find a point with high density in X - t , we construct a sequence I of in- 
tervals i\ D ■ ■ ■ D i t and disjoint blocks Bi , . . . , B t of intervals such that i m £ B m 
and the sum of densities D(i\ \ Bf) + ■ — b D(i m \ B m ) is at least some number 
s m , for nr = 1, . . . , t. In this construction, every block B m consists of intervals from 
the neighborhood of the interval i m _i whose colors are within a certain range. More 
formally, B m = C[x — 192 • r m + 1, x — 192 • r m _i] fl where r m < s m , for 

to = 1, ... ,t. Since the intervals in / form a decreasing sequence, the density of the 
interval i t is large, i.e. D(i t \ C[x — 192 - r t + 1, a:]) > s*. The numbers r rn and s m 
will be chosen in such a way that 192 • r m and 192 • s m are integral. 

In the induction step we would like to find an interval i t +i C it with high density 
in B t+ 1 . However, there may not be any interval with high density in B t + 1 , or the only 
intervals with high density in B t +i may overlap f. Before we explain below how to 
deal with these cases, we introduce some more notation. 

Let / = (*i, . . . , if) be a sequence of intervals and let N = (m, . . . , nf) be a 
sequence of real numbers, where each n m is of the form 2 k ~ 2 for some integer k > 0. 
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In particular, we have n m >1/4 for all m = 1, ... ,t. The following parameters are 
defined on the pair (TV, I) for 1 < m < t : 

- Let s m be the sum of the first m numbers of the sequence TV, i.e. s m = Yl'JLi n i> 

and set so = 0- Note that 4 • s m is integral, since all rij’s are of the form 2 fc_2 . 

- Let r ryi = s m _i + n m / 2 < s m , and set ?’o = 0. Observe that 8 • r m is integral for 

the same reason as above, 

- Let B m be the disjoint blocks of intervals defined by B m = C[x-192-r m + l,x — 

192 • r m - 1 ] IT for m > 1, where i o = R. B rn consists of all intervals in 

X\ that intersect the previous interval i m -i of the sequence I and have a color in 
the given range. 

- Let c m be the centrality, and d rn be the density of the interval i rn within the intervals 
bi i.e. Cm — c(i m \ ^m) and d m — D(i m \ Bm')- 

We call a pair (TV, I) admissible if the following conditions hold for 1 < m < t: 

( 1 ) The interval i rn is contained in the set B m , i.e. i rn £ B m - 

(2) The centrality of the interval i m with respect to B m is at least 2, i.e. c m > 2, if 
m > 1. 

(3) The number n m + 1 > n m 4 2_c + 1 , if m < t. 

( 4 ) The density of the interval i rn with respect to B m is at least n m , i.e. d m > n m - 

Notice that if (TV, I) is an admissible pair, then the density of the interval i t is at 
least St, i.e. D{i t \ T\) > st.. Since the blocks B m are disjoint, and the intervals i m are 
contained in each other by (2), the density of the interval i t is at least the sum of the 
densities d m , which in turn are at least n m by ( 4 ). By definition s t constitutes exactly 
this sum. 

We shall prove the lemma by first showing that there exists an admissible pair 
(TV, I), and if (TV, I) is such an admissible pair and the number x of used colors fulfills 
x > 192 • st then there exists an admissible pair (TV', I') such that s' t , > St- Here and 
below primes indicate elements and parameters of the new admissible pair. 

For the basis of the induction we need an admissible pair. Let i be any interval 
colored x. 

If b(i) = 1/2, then TV = (1/2) and I = (i) is an admissible pair consisting of 
t = 1 interval. Its parameters are as follows: n-\ and therefore si equal 1/2, and rq = 
ni/2 = 1/4. Since the interval i\ - i is colored x, it is contained in the block B\ = 
C[x — 48 + 1, x). Further its density d\ in B\ is at least 1/2 = m, since the interval 
i\ itself contributes a bandwidth b{i\) = 1/2. The conditions (2) and (3) do not play a 
role for t = 1. 

lfb(i) < 1/2, then because of First-Fit there is a point r £ i, where intervals colored 
x — 1 intersect, whose total bandwidth exceeds 1/2. Otherwise the interval i would have 
been colored x — 1. These intervals form a clique L and according to Lemma 1, there 
exists an interval j £ L with density D(j \ L) > 1/4. Then / = (j) and TV = (1/4) 
is an admissible pair. Again we have to check the conditions. The numbers ni and si 
are 1/4, whereas rq = n-i/2 = 1/8. Since the interval i\ = j is colored x — 1, it is 
contained in the block B\ = C[x — 24 + 1, x]. The condition ( 4 ) is fulfilled, because 
the density of the interval is cZi > 1/4 = n\. 
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Now fix an admissible pair (TV, I). We begin the process of constructing (TV', I'). 
Let B = C[x — 192 • s t + 1, x — 192 • rt] D N(i t ). 

The bandwidth of the interval i t is b(it) < 1/2. Since i t € B t , the color f(i t ) of 
the interval i t is larger than x — 192 • r t . According to First-Fit, there exists for each 
color a £ [x — 192 • St + 1, x — 192 • r t \ a point r a £ i t such that there are intervals 
with color a intersecting at r a , whose band widths sum up to more than 1/2. Otherwise 
the interval i t would have received a smaller color. Since s t — r t = n t / 2, there are at 
least 96 • n t places r a where intervals with total bandwidth greater than 1/2 intersect, 
and according to Lemma 1, we find at least 96 • nt intervals of different colors each with 
density exceeding 1/4 within its color class. Let M denote this set of intervals. 

We partition M into level sets according to the left-to-right ordering of the intervals. 
Let Lk = {i € M : h(i \ M) = k < d{i \ M)} and Rk = {i £ M : h(i \ M) > k = 
d{i | M)}. For an interval i € M, i G L k U Rk or i G M\ \Jj <k {Lj U Rj) implies that 
the centrality c(i \ M) of i with respect to M is at least k. 

By the pigeon hole principle (at least) one of the following four cases holds true. 
Otherwise M would contain fewer than 96 • n t intervals. 

(i) There is an interval i £ M with high centrality c(i \ M) > 3 + [~log 4 nt] ■ 

(ii) There is a large central level set, i.e. | L k \ > 8ntA 2 ~ k or \R k \ > 8ntA 2 ~ k for some 
k such that 2 < k < 2 + |~log 4 n*] . 

(iii) There is a large first level set, i.e. \L\] > 8 n* or |i?i| > 8 nt. 

(iv) There is an extra large outer level set, i.e. |Lq| > 28 nt, or |i?o| > 28 nt. 



- Case (i): 

If there is an interval i £ M with high centrality, we extend the sequence / by this 
interval i. The length of the sequence increases by 1. Hence, we set t' = t + 1, 
i' t+1 = i, and n' t+1 = 1/4, because the interval i' t+1 has density greater than 1/4 
within its color class. 

We have to show that N' = N o n’ t+l and r = i°i' t+ x form an admissible pair 
(TV' , I'). For condition (1) we have to observe that the interval i' t+l is chosen from 
B which is a subset of B' t+1 , because r' t+1 = s t + n' t+1 /2 > s t . Hence, i' t+1 £ 
B' t+V Condition (2) holds, because c' t+l > 3 + [log 4 nt . ] > 2 since nt > 1/4. The 
number n' t+l = 1/4 = 4 _1 = ?r t 4 2_3_loS4 ” > n t 4 2 ~ c + 1 fulhlls the condition 
(3), and the density d' t+1 > 1/4 = n' t+1 , which establishes condition (4). The 
density of the interval i' t , with respect to I\ is at least s' t , = s* + 1/4. 

- Case (ii): 

If there is a large central set, we can without loss of generality assume that \L k \ > 
8ntA 2 ~ k . The other case is exactly symmetric. Note that L k is a clique in X\ 
since i.j £ M and i < j implies that h(i \ M) < h(j \ M). Furthermore, 
each interval in L k has a density greater 1/4 within its color. Thus, by applying 
Lemma 2 with p = 1/4 we can find an interval i £ L k with density D(i \ B) > 
ntA 2 ~ k . We extend the sequence I by this interval i and set the values t' = t + 1, 
i' t _ |_i = i, and n' t+1 = ntA 2 ~ k ■ Note that n' t+1 > 1/4 since n' t+1 = nt4 2_fc > 
n t 4 2_2_ r iog 4« 1 > n t 4” log4n _1 = 1/4. 




10 



Udo Adamy and Thomas Erlebach 



Again, we have to verify that N' = N o n' t+1 and r = i°i' t+ 1 form an admissible 
pair (TV',/'). The condition (1) is satisfied, because the interval i' t+1 is chosen 
from B, which is a subset of B' t+1 , since r' t+1 = ,s t + n' t+1 /2 > s t . The centrality 
c' t+1 = k > 2 and the number n' t+1 = ntA 2 ~ k fulfill the conditions (2) and (3). 
The condition (4) concerning the density of the interval i' t+1 holds, because d' t+1 > 
n t A 2 ~ k = n’ t+1 . 

Hence, the density of the interval i' t , with respect to I\ is at least s' t , = st + ntA 2 ~ k . 

- Case (iii) 

If there is a large first level set, we can without loss of generality assume that 
L\ | > 8 rit. For the same reason as in Case (ii) the set L\ is a clique i n X \ , and each 
interval in L\ has a density greater 1/4 within its color class. Hence, by Lemma 2 
(again with p = 1/4) there exists an interval i £ Li with density D{i \ B) > n*. 
Since this interval i with high density in B' t+1 might have a centrality of one only, 
thus violating the condition (2), we replace the interval i t with the interval i. We 
set t' = t,i' t = i, and n! t = 2 n t . 

Then the new block B[ I) B t U B, since r' t = s t _i + n' t /2 = st- Also the interval 
i' t C it, because i' t has centrality 1 in the subset M of the neighborhood N(i t ). 
Hence, i' t £ B' t establishing the condition (1). The condition (2) simply holds, 
because we have c' t > c t > 2, if t > 1. For condition (3) a rough estimate suffices, 
namely n' t > n t > n t -\ 4 2_c > n’ t _ x 4 2_c . The density condition (4) is satisfied, 
because d' t > D(i \ B) + d t > 2 n t = n' t . 

In total, the density of the interval i' t , with respect to X\ is at least s' t , = s t + n t . 

- Case (iv) 

If there is an extra large outer level set, we can without loss of generality assume 
that | To | > 28rit. Each interval in Lq has density greater 1/4 within its color 
class. Let e be the leftmost point that is contained in the interval i t as well as in all 
intervals in L$. We define T := B t U B. Then, the density D(e \ T) of the point 
e with respect to T is at least the density of the interval it with respect to Bt plus 
the density of e within the set L 0 . Hence, D(e \ T) > D(i t \ B t ) + D{e \ L 0 ) > 
nt + 7 nt = 8 nt., and according to Lemma 1 we find an interval j\ in the clique of 
intervals in T containing e, such that D(j\ \ T ) > 4nt. 

Additionally, we choose an interval j 2 from B t such that its centrality c(j 2 Bt) = 
Ct — 1 and j 2 < it- Note that such an interval must exist, because of the centrality 
of the interval i t in B t . 

Now we distinguish two cases according to the left interval endpoints of j i and 
j 2 • Either the left endpoint of j-> lies further left, which implies that the centrality 
c(j i | T) > c(j 2 | T) > ct — 1, or the left endpoint of the interval j i lies further 
left, in which case the interval j 2 is contained in j-\ . and thus the density D(j 2 \ 
T) > D(j i | T) > An t . 

In either case there exists an interval i £ {ji , J 2 } in T such that D(i \ T) > An t 
and c(i \ T) > c t - 1. In order to fulfill the condition (2) we have to replace either 
the interval it or the interval i t - 1 with the new interval i depending on the centrality 

c(i | T). 
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• If the centrality c(i | T) > 2 or t = 1, we replace the interval i t with the new 
interval i, and set the values t' = t,i[ = i, and n' t = Ant.. 

The new block B[ D T, since r' t = St - i + n ' t / 2 = St - 1 + 2 n t = s t + n* > .s t . 
Because of the centrality of the interval i' t the condition (2) is clearly satisfied, 
and we have also that i' t C it- 1 , if t > 1. Hence, the interval i' t is contained 
in B' t fulfilling the condition (1). To verify the condition (3) we need a short 
calculation. We have n' t = Ant > 4nt_i4 2_c > n' t _ l 4 2_c , where the last 
inequality holds because c' t > c t - 1. The condition (4) holds because the 
density d! t > Ant — n' t . 

Hence, the density of the interval i' t , with respect to X\ is at leasts/ = s* + 3n*. 

• Otherwise c(i | T) = 1 and t > 1. Thus the centrality c t = 2 and nt > n t -i. 
In this case we discard the interval i t and replace the interval i t - 1 with the new 
interval i. We set the numbers t 1 = t — 1, i , t _ 1 = i and n' t _ 1 = 4n t . 

We are left to show that the sequences N' = Nt - 1 o ?■(/_! and /' = /t _2 o i' t _ 1 
form an admissible pair (N' , I'). 

For condition (1) observe that the new block B[_ 1 D B t -\ U T, since r’ t _ 1 = 
St — 2 + n' t _ 1 /2 = St — 2 + 2 n t . > s t - 2 + n t _i + n t = s t .. Because of the 
centrality of the interval i' t _ 1 we have i t - 1 D Thus, we can conclude 
that £ B't-i - Since the interval i' t _ 1 is contained in the interval it- 1 , the 
centrality d t -\ > O-i, which establishes the condition (2), because q_i > 2, 
if t > 2. The condition (3) holds, since n' t _ 1 = Ant > Tit - 1 > rit_24 2_c _1 > 
n' t _ 2 A 2 ^ c - 1 . Finally, the density condition (4) is fulfilled because <- 1 > 
D(i | T) + dt-i > A n t + n t _i > An t = n' t _ % > 

This time, the density of the interval i' t , with respect to 1\ is at least s' t , = 
St - 2 + Ant > s t - 2 + nt - 1 + n t + 2 n t = s t + 2 n t . □ 



4 Conclusion 

We have presented an online algorithm for coloring intervals with bandwidth require- 
ments that achieves a constant competitive ratio, where the constant is at most 195. 
This large constant is presumably not optimal, since already the analysis of the First-Fit 
algorithm for interval graphs by Kierstead [8] is not optimal. We extend this analysis 
to the case of bandwidth requirements in (0, 1/2]. At this time we do not know how 
to generalize the optimal online algorithm for interval graphs [11] to the case of arbi- 
trary bandwidths between 0 and 1/2. An improved algorithm or a better analysis of the 
First-Fit method would be interesting future research. 
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Abstract. In this paper the process of data transmission in optical 
communication networks is modeled as a shop-type scheduling prob- 
lem, where channels (wavelengths) are treated as machines. We formu- 
late an Open Block problem with the minimum makespan objective (an 
OB \ ICmax problem) in which a relation of a new type between the opera- 
tions of each job is introduced: any two operations of a job have identical 
processing times and may be processed either completely simultaneously 
(in a common block) or, alternatively, with full diversity in time. We 
show that the problem is polynomially solvable for 4 machines, binary 
NP-hard for 6 machines and strongly NP-hard for a variable number of 
machines. Adding release dates to the two-machine problem also leads 
to the NP-hardness in strong sense. For the case of a variable number of 
machines we present a polynomial time \/2-approximation algorithm and 
prove that there is no polynomial time p-approximation algorithm with 
p < 11/10, unless P=NP. For the case when the number of machines is 
fixed, we show that the problem can be solved by a linear time PTAS and 
by a few linear time statistically optimal algorithms (generating optimal 
schedules for almost all instances). 



1 Introduction 

In this paper we study a scheduling problem that models the process of data 
transmission in optical communication networks with n nodes and m channels. 
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We assume that each node has a receiver which is pre-tuned to a fixed channel, 
and has a “tunable” transmitter (laser) which can transmit on several channels 
simultaneously. Each node on the transmitting side has a multicast data packet 
that must be delivered to all receiving nodes in its destination set. The trans- 
mission of a data packet completes when each destination node receives at least 
one copy of the packet. The goal is to minimize the overall transmission time, 
i.e. , the time for all data packets to be received. We consider a multicast service 
with fanout splitting , which means that any data packet can be split into sev- 
eral packets, identical by their contents but having element-wise different sets of 
destination channels. (For details see [8], [14].) 

Data transfer in such optical networks must meet the following natural re- 
strictions: 

(a) at any point in time no two transmitters may transmit information on the 
same wavelength (channel); 

(b) at most one data packet can be sent by a transmitter at a time (although, on 
several channels simultaneously) . Below we formulate a new scheduling problem 
based on these conditions. 

Open Block Problem (short notation: OB||C max ). We are given a set of n 
jobs J = {Ji, . . . , J n } and a set of m machines M = {Mi, . . . , M m }. For each 
job Jj , a processing time pj and a subset of indices Vj C {1, . . . , m} are specified, 
so that job Jj consists of \vj\ operations that have to be processed on machines 
{Mi | i <E vj} and have identical processing times equal to pj. Any two operations 
of a job may run either simultaneously (in a block), or should not overlap in 
time. No restrictions on the combining of operations into blocks and on the 
order of processing the blocks are specified. Each machine can process at most 
one operation at a time. Preemption is not allowed. The goal is minimizing 
the makespan. (Fig. 1 illustrates a feasible schedule for a given instance of the 
OB-problem.) 




Index Sets 

Vi = { 1 , 3 } 

V 2 = {1} 

= { 1 , 2 , 3 } 
va = { 2 , 3 } 



Schedule 




Fig. 1 . A feasible open-block schedule for jobs Ji , . . . , J4 on three machines 



Thus, a novel feature of the above problem is a relation of a new type be- 
tween the operations: now any two operations of a job may be processed either 
completely simultaneously, or with a full diversity in time. 
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Related Problems and Results. Clearly, given an instance of the OB-problem, 
it can also be treated as an instance of the Multiprocessor Tasks problem (MPT) 

on one hand, and of the Open Shop problem — on the other hand. Moreover, 
any schedule constructed for that instance and feasible with respect to either 
the MPT, or the Open Shop problem is also feasible with respect to the OB- 
problem, and therefore, efficient methods elaborated for those two problems can 
be useful to provide good solutions for the OB-problem as well. 

Let us first review the known results for the 0||C max problem. As is known 
from [6], the 02\\C max problem admits a linear-time solution, while the 03\pij = 
Pj\C max problem is binary NP-hard. NP-hardness of the general O \ | C rnax prob- 
lem in strong sense follows from the result proved in [15]: deciding whether there 
exists a schedule of length at most 4 is strongly NP-complete. This also im- 
plies that for the 0||(7 max problem with a variable number of machines and any 
p < 5/4 there is no p-approximation algorithm, unless P = NP. At the same 
time, it is well known that any greedy algorithm provides a 2-approximation. 
For the Om| |C max problem (when the number of machines m is arbitrary but 
fixed), we can suggest a polynomial time approximation scheme (PTAS) [12]. 
Efficient statistically optimal algorithms able to construct optimal schedules for 
almost all instances of the 0||C max problem were designed in [3,9-11]. 

The MPT-problem seems to be much harder. Already MPT3\ |C max is strong- 
ly NP-hard [7]. However, MPTm\pj = l|C max can be solvable in polynomial 
time [7]. A PTAS is proposed for MPTm\\C max in [1]. On the other hand, 
M PT\pj = l|C max cannot be approximated within a factor of m 1 / 2-6 , unless 
P = NP [4]. Matching this, polynomial time 0(y / m)-approximation algorithms 
for M PT\pj = 1 ICmax and M PT\\C max were recently proposed in [2]. 

Our Results. It is shown that the problem can be solved in polynomial time, 
if the number of machines is at most 4, whereas the 6-maclrine problem is bi- 
nary NP-hard. The problem with a variable number of machines is shown to 
be strongly NP-hard, while adding different release times makes the problem 
strongly NP-hard already for two machines. The problem with a variable num- 
ber of machines admits a polynomial time \/2-approximation, but cannot be ap- 
proximated better than within a factor of 11/10 of the optimum, unless P = NP. 
In contrast to the above result, for the case of an arbitrary fixed number of 
machines it can be easily shown that the problem admits a linear time PTAS. 

As an easy corollary of results known for the Open Shop problem, we also 
establish that the problem becomes polynomially solvable for the set of instances 
with sufficiently large values of the maximum machine load — in comparison 
with the maximum job processing time. (It can be shown that on the set of 
instances with a fixed number of machines this property almost always holds for 
increasing number of jobs.) 

The paper is organized as follows. In Section 2 we introduce necessary no- 
tions and notation. In Section 3 a few known polynomially solvable cases of the 
OB-problem are presented. Section 4 contains NP-hardness and inapproxima- 
bility results. A few constant-factor approximation algorithms and a PTAS are 
presented in Section 5. 
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2 Preliminaries 

We define the maximum processing time of an operation as p max = max 7 - p,j , 
and the length dj of job Jj € J as the total length of all its operations; d max = 
maxj e j dj stands for the maximum job length over all jobs. The total processing 
time of the operations on machine Mi is denoted by £i and called the load of 
machine Mp, f max = max^j and C max (S) denote the maximum machine load 
and the length of schedule S respectively; OPT stands for the optimum. 

We say that a polynomial-time algorithm A is a p- approximation algorithm 
(or provides a p- approximation) for a minimization problem, if for any instance 
of the problem it outputs a feasible solution of cost at most p • OPT. We say 
that a minimization problem admits a polynomial-time approximation scheme 
(PTAS) if it can be solved by a polynomial-time (1 + ^-approximation algorithm 
for any fixed e > 0. 

It is clear that for any instance of the OB-problem (as well as of any schedul- 
ing problem on dedicated machines with the standard constraint that each ma- 
chine cannot process more than one operation at a time), 

f max < OPT. (1) 



Normal Schedules and Instances. Given an instance I of a scheduling prob- 
lem, its feasible schedule S is called normal , if 



c max (5) = e n 



(2) 



Due to (1), any normal schedule is optimal. The converse is not true, in 
general. An instance I is called normal , if its optimal schedule is normal. 

On the face of it, such a coincidence like in (2) seems to be improbable. 
However, it will be shown in section 3 that most instances of the OB-problem 
are normal, which approves the term. 

No-Idle Schedules. We say that a schedule is left no-idle ( right no-idle), or 
LNI ( RNI ), for short, if each machine M t £ M executes all its operations 
within a connected time interval = [tf, t'f] (i.e., without inner idle time), and 
all intervals A start (finish) at the same point in time t\ — t (t'f = t). 

Singles and Multi-jobs. A job Jj is called a single, if it consists of only one 
operation. Otherwise, it is called a multi-job. 



3 Polynomially Solvable Cases 

Theorem 1 . Every instance of the OB-problem with f machines and n jobs has 
a normal schedule which can be found in 0(n) time. 

Proof (sketch). We proceed as follows. First, we examine the list of jobs, 
Ji, Ji, ■ ■ ■ , J n , and glue any two jobs Jj and together each time, when their 
index sets coincide (vj = Vk), i.e., they have to be executed on the same sets of 
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machines. We terminate gluing as soon as all jobs have distinct index sets. This 
requires at most 0 (n) time. 

We obtain an instance containing 4 singles , 6 double-jobs , 4 triple-jobs , and a 
single multi-job with 4 operations. The latter can be removed from our instance 
without loss of generality: after constructing a normal schedule for the remaining 
instance, we can simply add the removed job as a single block at the end of the 
schedule. So, let now £ max stand for the maximum machine load in the remaining 
instance I*. 

Next, we aim at constructing a normal schedule. Let Ji (i = 1 , ... ,4) stand 
for the triple-job with the missed operation on machine Mi , and let Jjj denote the 
double-job with two operations on machines M,; and Mj : pi and pij will denote 
the processing times of those jobs, respectively. We define three numbers: 

ai2 = min {p 12, p 34 }, 043 = min{p 13 ,p 2 4 }, £*14 = min{p 14 ,p 2 3 }, 

and compute a* = max{ai2, 013, £*14}. Assuming w.l.o.g. that a* = a 12, we 
divide the set of all multi-jobs into two subsets: J' = { J12, J13, J24, J3 > J4} and 
J" — {J34, J14, J23, Ji, J2}. We construct a feasible LNI-schedule starting at 
time 0 for the jobs in J' and a feasible RNI-schedule finishing at time £ max 
for the jobs in J" . Then we put the remaining singles by using first-fit. For an 
illustration, see Fig. 2. Clearly, if we succeeded at all steps, the resulting schedule 
is normal. □ 
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Fig. 2. Scheme of the normal schedule S for 4 machines 



It is clear that given an instance of the OB-problem, we may consider it as 
an instance of the Open Shop problem and provide for it a feasible open shop 
schedule in which no two operations of the same job overlap. Such a schedule is 
also feasible for the original OB-problem, and when it is normal, it is evidently 
optimal for the OB-problem. 

In [10,9], a few sufficient conditions were derived which, given an instance 
of the Open Shop problem satisfying one of these conditions, guarantee that a 
normal schedule for that instance exists and can be found in polynomial time. 
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These results can be successfully applied to the Open Block problem, enabling 
one to construct optimal schedules for most instances in polynomial time. 

Theorem 2. Any instance of the OB\\C max problem with n jobs and m ma- 
chines satisfying at least one of the inequalities 

<max > ^ y m log 2 TO + y TO - 7 - 4 ^ P max? (3) 

Imax ^ nr p m axi (4) 

has a normal schedule exists which can be found in 0(min{nm + to 4 log 2 to, 
nTO 2 log??7.}) time in the case of (3), and in 0(min{nm + to 8 , n 2 m 2 }) time in 
the case of (f). □ 

Remark 1. One can observe from the above theorem that if all job processing 
times and the number of machines are bounded from above while the number 
of jobs infinitely increases and the mean job processing time is positive, then 
£ max tends to infinity and the conditions of Theorem 2 are satisfied with high 
probability. This implies that most instances of the OB-problem are normal and 
optimal schedules for such instances can be found by one of the polynomial time 
algorithms of Theorem 2. 

Another interesting polynomially solvable case of the OB-problem is for- 
mulated in Theorem 3. It is used in Section 5 in designing an approximation 
algorithm. 

Theorem 3. Every instance of the OB-problem in which the processing times of 
all jobs are integral degrees of 2 is normal. Its optimal (normal) schedule can be 
found for the problem with n jobs and m machines in 0{mn + n log n) time. □ 

The following LNIS-algoritlrm constructs the desired normal schedule. 

LNI Scheduling (LNIS): 

Step 1. Number the jobs in nonincreasing order of their processing times: 
Pi > P2 > ■ ■ ■ > Pn- 

Step 2. Considering the jobs in this order, schedule each operation of the 
current job as early as possible, without any idle time on the corresponding 
machine. 



One can describe a similar RNI Scheduling algorithm (RNIS) that constructs 
at Step 2 an RNI-sclredule finishing at point £ max (instead of the LNI-schedule 
being produced by the LNIS-algoritlrm). 

4 Hardness and Inapproximability Results 

First we formulate basic NP-complete problems which will be used in our NP- 
lrardness proofs. 
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PartitionM 

Instance: n positive integers e±,. . . ,e n € Z + and and an integer E £ Z + such 
that EILi e, = 9 E. 

Question: Is there a subset of indices N C {1, . . . , n } such that 

= 6 If? (5) 

i£N 

Monotone-Not-All-Equal-Different-4Sat (Modiff-4Sat) 

Instance: A set U of variables, a collection C of clauses over U such that each 
clause has size exactly 4 and contains only unnegated different variables. 
Question: Does there exist a truth assignment for U such that each clause in C 
has at least one true variable and at least one false variable? 

Theorem 4. The OH6||C max problem, is NP-hard. 

Proof. We prove that it is to NP-complete to verify whether a given instance of 
the 6-machine OB-problem has a normal schedule. To this end we construct a 
reduction from the PartitionM problem to our OB-problem. Given an instance 
{ei, . . . , e n ; E} of the PartitionM problem satisfying JA e, = 9 E, we define 
an instance I* of the OB-problem as follows. 

We introduce 6 machines Mi, . . . , Mq and 8 basic jobs: big jobs 61,62,63, 
medium jobs mi, m 2 , and small jobs si,s 2 ,s 3 with processing times: 

P (6i) = 70, p (6 2 ) = 73, p (63) = 67, p (mi) = 17, p(m 2 ) = 20, 

p(si) = 13, p(s 2 ) = 10, p(s 3 ) = 7. 

The machines are prescribed to execute the following jobs: 

Mi: {6i,mi,si}, M 3 : { b 2 ,m 1 ,s 2 }, M b : {6 3 ,to 2 ,Si}, 

M 2 : {6i ,m 2 ,s 2 }, M 4 : {6 2 ,to 2 ,s 3 }, M 6 : {6 3 ,m 1 ,s 3 }. 

Note that the load I, of each machine M, (i = 1, . . . , 5) is equal to 100, while 
the current load of Mq is 91. In addition to operations of basic jobs, machine Mq 
is prescribed to execute n tiny singles {ti , . . . , t n }. Each job t, has length ei/E. 
Thus, the total load of machine M (i is also 100, and so we have £ max = 100. 

To prove the theorem, it suffices to show that the question in the Parti- 
tionM problem has the positive answer, if and only if the instance I* has a 
normal schedule. 

•4= Assume that there exists a normal schedule S. Since p ( bj ) > f max /2, the 
operations of each big job bj must be executed as a block. Since no two different 
jobs have equal processing times, 61 and b 2 cannot be executed in the middle of 
the makespan [0,£ max ]. So the blocks of both jobs can only be scheduled either 
at the beginning, or at the end of the makespan. W.l.o.g., we may assume that 
61 is processed first on Mi and M 2 . 
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We first prove that b 2 must be the last job on M3 and M4. Assume the con- 
trary. Then 62 must be the first job on M3 and M4. Since p (mi) > {p (s\),p (S2)} 
and p(bi) ^ p{b 2 ), job mi has to be the last on both Mi and M 3 . Thus, S2 is 
in the middle on M 3 , and so it must be the last on M 2 , while m2 occurs there 
in the middle. But in this case we cannot place job m2 on M4 properly. This 
proves our claim. 

We now consider three possible locations of job 63 on machine M%. 

Case 1: b 3 is the third on Me (see Fig. 3). Then it is also the third on M5. 
Since 2 p(m 2 ) + ^(63) > Lax and p(s 1) ^ p(s3), job m2 can be the second 
neither on M4, nor on M 5 . Therefore, it must be the first on both machines 
whereas s 3 on M 4 must be the second. 
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M, I b, 
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m 2 


m 2 bj 
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1 1 * 1 
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M 6 \ s 3 \ m, 


b 3 
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Fig. 3. Normal schedules for the basic jobs in Case 1 



If mi is the first on M 3 (Fig. 3A), then due to the inequality 2 p {m\)+p (63) > 
Lax) it also must be the first on M 6 . Therefore, S3 must be scheduled on M 6 
exactly in the interval [20, 27], so as to form a block with its operation on M4. 
This implies the existence of a subset N satisfying (5). 

If mi is the second on M 3 (Fig. 3B), it must constitute a block (scheduled in 
the interval [10,27]) with its operation on M$. The tiny jobs scheduled on Mq 
between jobs m\ and 63 evidently determine the desired subset N satisfying (5). 

Case 2: 63 is the second on Me (see Fig. 4). Then it is also the second on 
M5. If m2 is the first on M5, then, due to p[rn\) > p(si), job mi must be the 
first on Me (see Figure 4A). Symmetrically, if m2 is the third on M5, then m\ is 
the third on M 6 as well (see Figure 4B). In both cases the existence of a subset 
N satisfying (5) is evident. 

Case 3: b 3 is the first on M 6 (see Figure 5). Then it is also the first on M 5 . 

Clearly, m2 must be the third on M2 and M5, and therefore, Si is the second 
on M5. This implies that si can only be the third on Mi whereas mi is the 
second. This, in turn, implies that mi is the second job on Mg constituting 
a block with its operation on Mi. This means that it must be scheduled in 
the interval [70,87], leaving for job S3 the only opportunity to be scheduled 
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Fig. 4. Normal schedules for the basic jobs in Case 2 
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Fig. 5. A normal schedule for the basic jobs in Case 3 



somewhere in the interval [87, 100]. Thus, the total length of tiny jobs processed 
on Mq after m\ is equal to 6, which implies the answer “yes” to the question of 
the PartitionM problem. 

=k Suppose that the answer to the question in the PartitionM problem 
is positive. Then a normal schedule for the instance /* can easily be produced. 
Indeed, any of the 5 schedules depicted in Figures 3-5 can be used for this 
purpose. □ 

A proof of the following problem will appear in a journal version. 

Theorem 5. The OB2\rj\C max problem is NP-hard in strong sense. □ 

Remark 2. By a similar argument it can be shown that it is strongly NP- 
complete to establish the existence of a feasible schedule for a given instance 
of the two machine OB-problem with deadlines. 

Now we turn to the following decision problem: given an instance of the 
OB-problem, does there exist a schedule of length at most 10? 

Theorem 6. The problem of deciding if there is a feasible OB-schedule of length 
at most 10 is NP-complete. 

Proof (sketch). To prove that the decision problem is NP-complete, we con- 
struct a reduction from the Modiff-4Sat problem. The technique used in our 



22 



Alexander A. Ageev et al. 



reduction differs from the one used in Williamson et. al. [15] in the following: 
instead of associating jobs and machines with literals (which obliges one to syn- 
chronize the jobs corresponding to literals of the same variable), we associate 
them with variables. □ 

As a straightforward corollary of the above theorem, we obtain 

Theorem 7. For the OB-problem with a variable number of machines and for 
any p < 11/10, there is no polynomial time p- approximation algorithm, unless 
P = NP. □ 

Corollary 1. For the OB-problem with a variable number of machines, there is 
no PTAS, unless P = NP. □ 

Now we should explain why in Theorem 6 we consider the time interval of 
length 10. (Clearly, if we could prove a similar result for a smaller value of t max , 
we would obtain a better lower bound on non-approximability.) The answer is 
contained in 

Theorem 8. There is a polynomial time algorithm that given an instance of 
the OB-problem with £ max < 9, constructs a normal (and therefore optimal) 
schedule. 

Proof. We show this for f max = 9. Other cases are easy and can be proved in a 
similar way. In fact, it suffices for each i G {2, . . . , 9} to prescribe a set of allowed 
locations for i-jobs within the interval [0,9], so that: 

(a) no two locations of i-jobs overlap in time; 

(b) for any feasible combination of jobs destined to be processed on one machine 

(i.e., such that their total length is at most 9), it should be shown that there exists 
a schedule for that machine in which each +job is assigned to one of its allowed 
locations. Define a set Li of allowed locations for each i £ (2, . . . , 9} as follows: 
L 9 = {[0,9]}, L 8 = {[1,9]}, L r = {[2,9]}, L & = {[3,9]}, L 5 = {[4,9]}, L 4 = 
{[0,4], [4,8]}, L 3 = {[0,3], [3, 6], [6, 9]}, L 2 = {[0, 2], [2, 4], [4, 6], [6, 8]} (clearly, 
1-jobs may be scheduled arbitrarily). It can be easily checked that, given a family 
of jobs { Ji, . . . , Jk} with total length < 9 containing a “long job” (of length > 5), 
it can be scheduled without overlapping within the time interval [0,9], so that 
each job gets one of its allowed locations. The remaining combinations of jobs 
with total length 9 can be sequenced as follows: 4 + 4 + 1, 4 + 2 + 3, 4 + 2 + 2 + 
1, 3 + 3 + 3, 2 + 1 + 3 + 3, 2 + 2 + 2 + 3, 2 + 2 + 2 + 2 + 1. □ 

5 Approximation Algorithms 

In this section we present a \/2-approximation algorithm. We present also a 
linear time PTAS for the case of an arbitrary fixed number of machines. 

First, we describe the First gluing procedure. The First gluing procedure 
combines all jobs in each u-class into a single job, with v as the machine set 
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and p(v) = 'fjj as the processing time of the new job. Using a bal- 

anced binary tree T with at most 2 m nodes (corresponding to classes J{ v), v C 
{1 , . . . , to}) and with the length of each path from the root to a leaf no greater 
than Ojm) we can prove that the First gluing procedure can be implemented 
in 0(m 2 n) time and completes with at most n! = min{n, 2 m } aggregated jobs 
having pairwise different machine sets. 

Squared-Round-and-Divide Scheduling (SRDS): 

Step 1. Applying the First gluing procedure, we get an instance with n! < 
min{n, 2 m j aggregated jobs { Jy} with processing times pj. As we noted, this 
can be done in 0(m 2 n) time. 

Step 2. Round the squared processing time p 2 of each aggregated job Jj up to 
the nearest value 2 a for an integer cty. Divide the whole set J of aggregated 
jobs into two subsets: J 0 dd (the set of jobs with odd exponents ay) and J eve n 
(with even exponents ay). Define instance J' dd by jobs { Jy € J 0 dd} with new 
processing times pj = 2 < - a -1 )/ 2 and instance J" ven by jobs { Jy £ J e ven} with 
new processing times pj' = 2 a ! 2 . Compute the loads tj and £" of each machine 
Mi in the instances J' dd and J" ve n . 

Step 3. Applying the LNIS-algorithm, construct the LNI-schedule S'} for the 
instance J' dd . 

Step 4. Retaining the job order on each machine and the block order for each 
job as in schedule S[ , find the earliest possible schedule Si for the jobs J 0 dd 
with the original processing times. Compute the completion time C) of each 
machine M* in this schedule. 

Step 5. Compute the parameter T = ma xjj gjvr(Cj + 

Step 6. Applying the RNIS-algorithm, construct the RNI-schedule S^T) (fin- 
ishing at time T) for the instance J" ven . Take the same starting times for the 
jobs Jj £ Jeven with the original processing times. 

Theorem 9. For any instance of the OB-problem, there exists a feasible sched- 
ule S with length C max (S) < \[2 f max ; the schedule can be found by the SRDS- 
algorithm in 0(m 2 ri) time, where m is the number of machines and n is the 
number of jobs. 

Proof. It is clear that all processing times {pj} and {pj 1 } in the instances 
J'dd and Jeven are integer degrees of 2, and hence (by Theorem 3), the LNIS 
and RNIS algorithms produce feasible no-idle schedules 5} and S^iT) for the 
instances I' odd and J" ven respectively. When we return to the original processing 
times, we obtain a schedule Si for the jobs in J odd ; it is feasible, because we keep 
the blocks of each job and their mutual order unchanged. For the jobs in J even we 
retain the same schedule 62 (T) (i.e., the same starting times of all operations). 
Therefore, it also remains feasible and keeps the property that on each machine 
Mj jobs in J even do not overlap with jobs in J 0 dd- Thus, the resulting schedule 
for the jobs with original processing times is completely feasible. To complete 
the proof of the theorem, it remains to show that T < \plii for each machine 
Mi. 
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Since T = C,; + £'( for some M t £ M , it suffices to show that for each 
machine M* we have C) < \FH' i and £" < y/2 £" , where l\ and I" stand for the 
loads of machine Mi on the sets of jobs J Q dd and J eve n in terms of their original 
processing times. While the second inequality is a simple consequence of our 
rounding technique, the first one needs a less trivial argument. 

The desired inequality follows from two properties: that £[ < £\ (which is 
clear), and that the completion time of each operation in schedule Si exceeds 
that in schedule S[ by a factor less than y/2, which can be proved by the induction 
on the job number and the block number (assuming that the jobs are numbered 
in the order of their consideration by the LNIS-algorithm, i.e., in non- increasing 
order of processing times pj, and the blocks of each job are numbered according 
to their starting times in schedule S[). 

Indeed, suppose that the above property holds for jobs Jk-i and for 

earlier blocks of job </*,, and we are scheduling the next block ( B ) of job </*,. Let 
the block consist of operations Oi, ... ,O s that are to be scheduled on machines 
Mi , ... , M s respectively (for simplicity of notation) . Since the starting time of 
block B is chosen to be as small as possible, it coincides with the maximum of 
the completion times of machines Mi, . . . , M s on the set of jobs {Ji, . . . , Jk-i} 
and the completion time of the previous block of job J*, (if any). In any case, 
it coincides with the completion time of some previously scheduled operation, 
and by the induction hypothesis, is at most \/2 times the completion time of 
schedule S[ . Since the length of block B is also less than a factor of \/2 of the 
length of that block for the rounded instance I ' , the required property follows. 

To complete the proof of the theorem, it remains to estimate the running 
time of the algorithm. It is clear that (not counting step 1) most costly steps of 
the algorithm are steps 4 to 6, where we have to apply the ordering procedure. 
Since the latter requires time 0(n' log n') and n! < min{n, 2 m }, this reduces to 
0(nm ) time, and so, the overall running time of the algorithm coincides with 
that of step 1 (namely, 0{nm 2 )). □ 

By Theorem 6 no PTAS can be designed for the OB-problem (unless P = NP) 
in the case when the number of machines is treated as a variable. An opposite 
result can be derived, if we treat the number of machines as a constant. In this 
case, a linear time PTAS exists similar to that proposed by Sevastianov and 
Woeginger [13] for the Open Shop problem. 

PTAS for the OB-Problem with a Fixed Number of Machines. For 

any fixed £ > 0, we present an algorithm A e that outputs a schedule S with 
makespan C max {S) < (1 + £) OPT. The desired (1 + ^-approximation can be 
easily attained in the case that d max < £ ■ f? max : a simple greedy algorithm [13] 
finds a schedule S with makespan 

drnax (S)< ^max H - ^max — (1 + e)£ 

max < (1 + e)OPT 

in time 0(m 2 n). The running time can be reduced to 0(nm + C(m,£)), if we 
first apply the following Second gluing procedure with d = e ■ £ max . Given a 
lower threshold d on lengths of jobs, we scan the list of jobs, gluing two jobs 
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together each time that the maximum of their lengths is no greater than d/2. 
The procedure terminates, as soon as each job in the resulting instance (but 
maybe one job) has length greater than d/2. 

In the case d max > e ■ £ max we divide the set of jobs J into three subsets 
L, M, S of large, medium , and small jobs, respectively: L = {Jj € J \dj > 
O ^max}? M — {J j C J | (X Cax — dj OL fmax}> S — { Jj C J \dj < 
a"t m ax}- The values of a' and a " are chosen with respect to the value of e 
so that: 

(a) the number \L\ of large jobs was bounded above by a constant; 

(b) the total length of medium jobs was at most e • £ max ; 

(c) the ratio a" /a' was small enough, so as to meet a! + a" + a"\L\ < e. 

As proved in [13], the numbers a' and a" with the above properties exist, 
the value of a" being at least a* = £/(e 125 • 2 m / £ ). The latter bound enables 
us to reduce the total number of jobs to at most a constant number 2 m/a* by 
applying the Second gluing procedure with the threshold d = a*£ max . Thus, we 
come to the following algorithm A e in the case d m ax > £ ■ I'max- 

Algorithm A e (in the case d max > e • £ m ax )’- 

Step 1. Apply the Second gluing procedure with d = e ■ e^ 1 ' 25 • 2 -m / £ • t max . 
Number the jobs with dj > d in nonincreasing order of dj . 

Step 2. Find a partition of the job set into subsets L, M, and S satisfying 
(a)-(c). 

Step 3. Construct an optimal schedule OPT(L) for the jobs in L. 

Step 4. Use the greedy algorithm [13] to place the jobs of M and S into 
OPT(L). 

Similar to [13], the following result can be proved. 

Theorem 10. For any e > 0 and any instance of the OB\\C max problem with 
n jobs and m machines, algorithm A s constructs a schedule S with makespan 

C max (S) < (1 + e)OPT. 

The algorithm runs in time 0(nm + C{m,e)), where is the time needed 

to find an optimal schedule for at most 0{2 m / e ) so-called “large” jobs. 
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Abstract. In a paper of Borodin, Nielsen and Rackoff [8], a framework 
for abstracting the properties of deterministic greedy-like algorithms was 
proposed. We extend their model so as to formally define “randomized 
greedy-like algorithms” and be able to prove lower bounds on the ap- 
proximability of a certain problem by such a class of algorithms. We 
show how our techniques can be applied in well-studied problems such 
as the facility location and makespan scheduling problems, for which 
both upper and lower bounds on the approximation ratio achieved by 
deterministic greedy-like algorithms are known. 



1 Introduction 

Borodin Nielsen and Rackoff [8] presented a framework for abstracting the main 
properties shared by several “greedy and greedy-like” deterministic algorithms. 
Within the proposed framework, they showed how to derive lower bounds on the 
approximability of various scheduling problems by such a class of algorithms. In 
this paper we extend the “priority algorithm” paradigm in [8] so as to define 
randomized greedy and greedy- like algorithms. More precisely, we show how 
to use the Von Neumann/Yao principle in the context of priority algorithms 
to prove lower bounds for algorithms that behave in a greedy-like fashion but 
also allow randomization. We apply the technique to two well-studied NP-hard 
problems: the problem of makespan scheduling, and the problem of metric facility 
location. 

According to [8] , deterministic greedy-like algorithms are classified as (deter- 
ministic) priority algorithms, namely algorithms with the following two proper- 
ties: i) The algorithm selects an ordering of the “input items” , and considers each 
input item in the determined order; and ii) As each input item is considered, the 
algorithm makes an “irrevocable decision” concerning the input item. Here, the 
decision on how to represent an input (i.e. , what the input items are) and the 
precise meaning of an irrevocable decision are pertinent to specific problems. 

Depending on whether the ordering of input items can change throughout 
the execution of the algorithm, we define two distinct classes of priority algo- 
rithms. Algorithms in the class Fixed Priority commit to an ordering prior 
to considering any input item, and the ordering cannot change in any subse- 
quent iteration. On the other hand, algorithms in the more general Adaptive 
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Priority class are allowed to specify a new ordering after each input item is 
considered and processed. Following the definition of [8], a (fixed or adaptive) 
priority algorithm belongs in the class Greedy if the irrevocable decision obeys 
the rule that the objective function is locally optimized ; specifically, the objective 
function must be optimized as if the currently considered input item were the 
last one. 

There are two natural ways to introduce randomization in a priority algo- 
rithm. First, the algorithm may choose a random ordering of the input items. 
Second, the algorithm can make random decisions for each input item 1 . In the 
case where randomization can be applied in both orderings and decisions we 
refer to such algorithms as fully randomized algorithms. 

In the online world it is a well-known fact that randomization can be helpful 
in improving the competitiveness of algorithms. As an illustrative example, we 
mention the randomized algorithm of Bartal et al. [4] for makespan scheduling 
on two machines which achieves the best possible competitive ratio, namely | 
(in contrast there is a | lower bound on the competitive ratio of every determin- 
istic online algorithm). Likewise, there is evidence that randomization may be 
useful in improving the approximation ratio of priority algorithms. For instance, 
Borodin et al. [8] presented a “classify and randomly select” priority algorithm 
for the problem of interval scheduling, and showed that it is a O(logZi) ap- 
proximation (here A is the ratio of the maximum profit per unit size over the 
minimum profit per unit size amongst all jobs in the input). In contrast, they 
showed that, for interval scheduling in a single machine, no deterministic priority 
algorithm can achieve better than a A approximation. 

Our main goal is to show that it is possible to derive interesting lower bounds 
on the approximability of optimization problems by randomized priority algo- 
rithms 2 . While the technique is general enough to address full randomization, it 
can also be applied to algorithms with more restricted access to randomness, i.e., 
randomized orderings or randomized decisions only. We apply the framework to 
the following well-known problems: 

1. The metric facility location problem, (a) For the model in which 
the input is represented as a set of facilities, we show that no fully-randomized 
priority algorithm is better than a | approximation. For randomized-ordering 
Fixed Priority Greedy algorithms, we prove a 1.5 lower bound on the ap- 
proximation ratio. (b) For the model in which the input is described as a set 
of cities, we show that no fully-randomized priority algorithm is better than a 
2-approximation. 

2. The makespan scheduling problem on identical machines. We show that 
no fully-randomized priority algorithm achieves an approximation ratio better 
than For Fixed Priority algorithms with randomized decisions, and when 
there are at last three machines, we show a corresponding lower bound of ^ . 

1 In particular, every randomized online algorithm can be classified as a Fixed Pri- 
ority, random- decisions algorithm. 

2 Due to space limitations we omit proofs of certain theorems and lemmas in this 
extended abstract. Full proofs will appear in the journal version of the paper. 
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We emphasize that, as in deterministic priority algorithms (and similar to 
competitive analysis in the online world) the lower bounds we derive are orthog- 
onal to any complexity considerations. In other words, we allow the algorithm 
unbounded time complexity. 

As one may expect, the criterion of what precisely constitutes a greedy- like 
algorithm can be very subjective. Hence one can argue that there exist algo- 
rithms which could intuitively be characterized as (deterministic or randomized) 
greedy algorithms, and which do not fit in the priority algorithm framework, 
e.g., greedy-like algorithms that apply iterative refinements of partial solutions, 
or greedy-like algorithms which can make temporary decisions about each input 
item, i.e. , decisions that can be revoked in future iterations (an example of an 
algorithm in the latter class is the one-pass Admission algorithm of Bar-Noy, 
Gulra, Naor and Schieber [3]). However, we claim that the priority-algorithm 
model still captures a rich class of (deterministic or randomized) algorithms 
that can be classified as greedy- like. To support the claim, Section 3 overviews 
some upper bounds for the problems we consider, and which are achieved by 
priority algorithms. 

Clearly, this line of work is motivated by the fact that greedy algorithms 
are a standard tool in both theory and practice, and thus it is useful to know 
in which cases the greedy approach will not perform efficiently. There are two 
more reasons we believe it is interesting to focus on the approximation power 
of the broad class of priority algorithms (as opposed to only considering a few 
“natural” greedy algorithms for every specific problem). First, it is possible that 
the intuition behind the lower-bound constructions could provide insight into 
how to design better randomized priority algorithms. Similar insight has proven 
quite useful in the design of online algorithms. For a concrete example, see [28] 
where the proof that no randomized online algorithm for 2-maclrine scheduling is 
better than |-competitive suggests how to design a randomized algorithm which 
is ^-competitive (a result due to Bartal et al. [4]). 

Second, it is not always the case that the best approximation is achieved by 
natural greedy-like (deterministic or randomized) algorithms. Leaving aside the 
debatable issue of characterizing a natural greedy algorithm (arguably, a very 
subjective task), there exist problems where good approximations are achieved 
by greedy-like algorithms which, if not “unnatural” , certainly are not straightfor- 
ward. For instance, the natural greedy algorithm of Mahdian et al. [21] for metric 
facility location does not achieve as good an approximation ratio as the priority 
algorithm of Jain, Mahdian and Saberi [20], which is described as a primal-dual 
algorithm and does not yet have a clear combinatorial statement. A similar situ- 
ation has been observed in online computation: for instance, some of the recent 
algorithms for makespan scheduling (see, e.g., [1]), as well as algorithms for load 
balancing in related and unrelated machines (see [6] for a detailed exposition) 
are definitely not among the most intuitive approaches one could come up with. 
Since every online algorithm is also a Fixed Priority algorithm, it is reason- 
able to believe that not every greedy-like algorithm is expected to have a natural 
statement. 
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2 Preliminaries 

Deterministic vs. Randomized Priority Algorithms and the Minimax 
Principle. Informally, we can define a randomized priority algorithm as a prob- 
ability distribution over the set of all deterministic priority algorithms of the 
same class. To make the definition more precise, we first provide a more formal 
description of deterministic priority algorithms. Consider a specific optimization 
problem 77; one can associate with 77 a set S of input items. For instance, 
for the problem of makespan scheduling, S is the (infinite) set of all possible 
jobs, each characterized by its size and its id. For every input item s € 5, de- 
note by D(s) the set of allowable decisions associated with s; e.g., in makespan 
scheduling with m machines, D(s) can be described by the set {1, . . . , to}, where 
decision i can be interpreted as “schedule input item s on machine i”. 

A deterministic priority algorithm for 77 is specified by two functions: the 
ordering function and the decision function. The ordering function is simply 
a function that produces an ordering of the input items in S. As observed in [8], 
one has to restrict the ability of the algorithm to produce an ordering, otherwise 
the algorithm can choose an ordering that can give rise to an optimal solution. 
The nature of the adversary described in [8] provides an (implicit) restriction on 
the types of ordering functions that are allowed. 

On the other hand, the decision function is a function which, given the cur- 
rent input item s £ 5, and the history of the algorithm’s execution, maps s 
to a decision in D (s). Here, the term history refers to all inputs the algorithm 
has considered in the past (and implicitly, to the algorithm’s corresponding de- 
cisions). 

A priority algorithm A for a cost-minimization problem is a c-approximation 
algorithm 3 if 

A(a) < c - OPT(a), (1) 

for every input cr, where A(cr), OPT(cr) denote the cost of algorithm A and the 
optimal cost on input cr, respectively. The approximation ratio of A is defined 
as the infimum of the set of values c for which A is a c-approximation. 

A randomized priority algorithm is a priority algorithm in which either the 
ordering function or the decision function, or both functions are randomized; 
we refer to such algorithms as randomized on the orderings, randomized on the 
decisions, or fully-randomized, respectively. For a randomized priority algorithm, 
the definition of a c-approximation is as in (1), with A(a) replaced by its expected 
value over the random choices of A, denoted by E[A(cr)]. 

As shown by Borodin et al. [8], one can derive lower bounds for determin- 
istic priority algorithms by evaluating the performance of every such algorithm 

3 Alternatively, A is an asymptotic c-approximation algorithm if A(o) < c- OPT(o) + 
o(OPT(o)). This is a weaker definition of approximability, since it ignores the (some- 
times pathological) performance of the algorithm on small input sets. The Von Neu- 
mann/Yao principle is still applicable, however one has to supplement (2) with ad- 
ditional constraints, similar to the argument of Chrobak et al. [10] in the context of 
competitive analysis of online algorithms (details will be given in the full paper). 
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on an appropriately constructed nemesis input. But how can we show bounds on 
the approximability of a problem by a randomized priority algorithm? First of 
all, we will assume that the adversary against a randomized priority algorithm 
has knowledge of the algorithm, but does not have any access to the random 
choices the algorithm makes in the course of the game. Borrowing the termi- 
nology of competitive analysis, we refer to such an adversary as the oblivious 
adversary. Under this assumption, one can resort to the Von Neumann/Yao 
principle to prove lower bounds on the approximation ratio of randomized prior- 
ity algorithms against such an adversary, similar to the application of the same 
principle in the analysis of online algorithms (as introduced by Borodin, Linial 
and Saks [7]). More specifically, suppose that there exists a constant c such that 
we can present a distribution T> over inputs cr such that 

Ed [A{a)\ > c ■ Ed [OPT {a)} , (2) 

for every deterministic priority algorithm A in the class C. Then the cost- 
minimization problem cannot be approximated by any randomized priority al- 
gorithm in the class C within a ratio better than c. 

Naturally, the crux in the application of the Von Neumann/Yao principle lies 
in the selection of the appropriate input distribution. In the case of Adaptive 
Priority algorithms, as well as fully-randomized Fixed Priority algorithms, 
one has to resort to ad hoc methods for identifying the desired distribution. A 
special case arises when one considers Fixed Priority algorithms with ran- 
domization on the decisions only , in the sense that one can have better insight 
on how to identify a good distribution. More precisely, the distribution can be 
determined by a game between a deterministic Fixed Priority algorithm and 
the adversary, as follows: the adversary presents a set of potential input items 
S , and having knowledge only of the ordering that is decided by the algorithm, 
removes input items from S according to a specific probability distribution, so 
as to generate the distribution over the actual inputs 4 . 

Problem definitions and input representations. The input to the makespan 
scheduling problem on m identical machines consists of a set of jobs, each having 
a certain size. The load of a machine is the sum of the sizes of jobs scheduled on 
it. The objective is to minimize the maximum load among the m machines. 

In the (uncapacitated, unweighted) facility location problem, the input con- 
sists of a set T of facilities and a set C of cities (we restrict our study to the 
case PnC = 0). Each facility i £ T is associated with an opening cost fi which 
reflects the cost that must be paid to utilize the facility. Furthermore, for every 
facility i € T and city j 6 C, the non-negative distance or connection cost Cij 
is the cost that must be paid to connect city j to facility i. The objective is to 

4 Note that an Adaptive Priority algorithm with randomized decisions will intro- 
duce randomization on the selection of the orderings as well: this is due to the fact 
that the next input to be considered depends on the outcome of processing inputs 
that were considered in the past. Hence, a similar game is not applicable in Adap- 
tive Priority algorithms. 
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open a subset of the facilities in T and connect each city in C to an open facility 
so that the total cost incurred, namely the sum of the opening costs and the 
connection costs, is minimized. We focus on the metric version of the problem, 
in which the connection costs satisfy the triangle inequality. 

When considering priority algorithms for the above problems, we need to 
establish a model for representing the input. For the makespan scheduling prob- 
lem, there is no issue as to what constitutes the input: the input is a set of 
jobs, with each job being characterized by its id and its size. In contrast, there 
exist two natural ways to represent the input to the facility location problem. 
In particular, we may assume that the input consists of facilities, where each 
facility is identified by a unique id, its distance to every city and its opening 
cost. When considering a facility of highest priority, the algorithm must make 
an irrevocable decision as to whether or not to open this facility. This model of 
input representation was assumed in [2] , since it abstracts several known priority 
algorithms (see the adaptive-priority algorithms in [21] and [20] as well as a fixed 
priority algorithm in [23]). We refer to the above representation of inputs as the 
facility-input model. 

A different way of representing the input is to view the cities as the input 
items. Namely, the input consists of the set of all cities, with each city being 
identified by its id, and its distance to each facility. The opening costs of the 
facilities are treated as global information. In this context, an irrevocable decision 
is interpreted as assigning each considered city to some open facility, possibly 
opening a new facility, but without affecting the assignment of cities considered 
in the past. We refer to this model of representing the inputs as the city-input 
model. 



3 Previous Work 

The first constant-factor polynomial-time approximation algorithm for (metric) 
facility location was given by Shmoys, Tardos and Aardal [30]. The past several 
years have witnessed a steady series of improvements on the approximability 
of this problem (the interested reader is referred to the survey of Shmoys [29]). 
Currently, the best-known approximation ratio (1-52) is due to Mahdian, Ye and 
Zhang [22], and is achieved by a non-priority algorithm. Guha and Khuller [17] 
have shown that the problem is not approximable within a factor better than 
1.463, unless NP C DTIME(n 0( - l ° slosn ' 1 ). 

We identify algorithms that can be described as priority algorithms. Mahdian, 
Markakis, Saberi and Vazirani [21] showed that a natural Adaptive Priority 
algorithm gives a 1.861 approximation; the analysis is performed by an elegant 
application of the dual-fitting technique. To our knowledge, the best approxi- 
mation achieved by a priority algorithm (as observed in [2]) is 1.61 and is due 
to Jain, Mahdian and Saberi [20] (also analyzed by using the dual fitting tech- 
nique). Mettu and Plaxton [23] showed that an algorithm which belongs in the 
class Fixed Priority Greedy yields a 3-approximation. Concerning the use 
of randomization, Meyerson [24] showed that a fully-randomized priority algo- 
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rithm can achieve a constant-factor approximation. In recent work Fotakis [12] 
presented a deterministic online algorithm with a competitive ratio of O( lo ^ > ^," w ) 
(here n denotes the number of nodes, and each node can be both a facility and 
a city); on the negative side, he showed that no randomized online algorithm 
can achieve a competitive ratio better than P ^'g^gn )? against the oblivious 
adversary. 

Makespan scheduling on identical machines is known to be strongly NP- 
lrard [13]. Hochbaum ans Shmoys [18] showed that it is possible to derive a 
PTAS, while Sahni [25] showed that, in the case where the number of machines 
to is fixed, there exists a FPTAS for the problem. For the variation of the problem 
in which jobs arrive on-line, Graham’s [15] famous List Scheduling algorithm is 
well-known to be (2 — A_)-competiti V e. Currently, the best competitive ratio 
achieved by a deterministic algorithm is 1.9201 [11], while the best known lower 
bound is 1.853 and it is due to Gormley et al. [14]. When randomization is 
introduced, a recent algorithm proposed and analyzed by Albers [1] achieves 
a competitive ratio of 1.916 for general to. Chen, van Vliet and Woeginger [9] 
and Sgall [27] showed that no randomized online algorithm can be better than 
1 _^ 1 j L — competitive. 

Concerning priority algorithms, Graham [16] showed that a simple algorithm 
called Longest Processing Time First (or LPT, for brevity), which sorts jobs by 
non-increasing size and assigns the current job to the least loaded machine, is 
a 1 -approximation ; note that LPT belongs in the class Fixed Priority, 
Greedy. Seiden, Sgall and Woeginger [26] showed that LPT is optimal for 
to = 2 when jobs are considered by non-increasing size (which they call semi- 
online scheduling) ; they also presented a ^-approximation semi-online algorithm 
with random decisions for to = 2, and showed it is optimal for the class of semi- 
online randomized algorithms when to = 2. Borodin et al. [8] showed that LPT 
is optimal with respect to all priority algorithms when to = 2. They also proved 
that LPT is optimal in the class Fixed Priority, Greedy, when to = 4, and 
that, for arbitrary to, no (deterministic) priority algorithm is better than a | 
approximation . 

4 Randomized Priority Facility Location 

In this Section we show lower bounds on the approximability of the metric 
facility location problem by randomized priority algorithms. All our lower-bound 
constructions for metric facility location use connection costs in {1, 3}, suggesting 
the following definitions. Let Cf be the set of cities at distance 1 from facility /. 
We say that / covers Cf. Two facilities / and /' are called complementary if and 
only if CfUCf = C (i.e., / and f cover all cities in the input), CffiCf = 0, and 
/ and f have the same opening cost. For convenience, we call f the complement 
of / (and vice versa), and we use the notation / to denote the complement of a 
given facility /. 
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4.1 Randomized Priority Facility Location 
in the Facility-Input Model 

Theorem 1. No fully-randomized priority algorithm for facility location has ap- 
proximation ratio better than |. 

Proof. Suppose we have n = 2 k cities, each with an id in {0, . . . , 2 fc — 1} in binary 
representation, and k pairs of complementary facilities as the set of potential 
facilities F. Each facility has an opening cost of ™ . The facilities are identified 
by the cities they cover in the following way: The i-th pair of complementary 
facilities, denoted by fi,f i with i < k is such that facility f t covers only cities 
whose i-tli bit is 0, while / i covers only cities whose i-tli bit is 1. The distance 
between a facility and a city it does not cover is equal to 3. By construction, any 
j facilities in F no two of which are complementary cover exactly n , 2~ l = 
n( 1 — 2-i) cities. 

Let A be a deterministic priority algorithm. We define a distribution over the 
actual inputs as follows. Only one of the k pairs of complementary facilities in 
F appears in the actual input, and that pair is chosen uniformly at random, i.e. , 
with probability ^ . For each of the remaining k — 1 pairs, only one of the two 
facilities will be in the actual input, and it is likewise chosen uniformly at ran- 
dom, i.e., with probability No other facilities are in the actual input. The 
optimal algorithm will always open the pair of complementary facilities, hence 
E [OPT] = 2j + n = Nf (for the remainder of the proof we refer to the two com- 
plementary facilities as the optimal facilities ). If A does not open both optimal 
facilities, the minimum cost it must pay is 2 n, and is achieved when A opens 
either two or three facilities (as shown in [2]). Clearly, A will pay at least 2 n if it 
opens more than 3 facilities, hence we may assume in what follows that A opens 
at most 3 facilities. 

The intuition behind the proof is that w.h.p. A will consider too many 
facilities for which it cannot have knowledge whether their complement is in the 
actual input. Since A can open only a small number of facilities, w.h.p. it will 
fail to open both optimal facilities. 

Lemma 1. The expected cost of A is at least (1 — e _ ™)(l — |) • 2 n. 

Proof. Let Hj be the set of facilities that A has considered before iteration j. 
Let gj denote the facility that would receive the highest priority in iteration j, 
assuming that the remaining set of facilities to be considered was F\Hj. For a 
fixed i, let ji be the earliest iteration of A in which gj = fi or gj = f t (note that 
such a ji always exists). We call the pair of (complementary) facilities (fi, /,) 
bad if and only if the facility gj is in the actual input. Note that if the pair 
(fi,fi) is bad, then at the iteration in which A considers gj , it cannot know 
whether its complement is in the actual input. 

Denote by B the set of bad pairs of facilities. Clearly, the pair of optimal 
facilities is always bad (regardless of the decisions of A), while each of the re- 
maining k — 1 pairs is bad with probability independently of every other pair. 
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Hence, we can say that \B\ = 1 + X , where X is the sum of k — 1 indepen- 
dent random variables, each of them equal to 1 with probability and 0 with 
the same probability. We can therefore apply the Chernoff bound, to get that 
Pr(|H| < |) < e _ ™. 

Let l denote the number of facilities that A opens, and recall that we can 
assume l < 3. Then, the probability A will open both optimal facilities is upper- 
bounded by the probability of success of any algorithm for the following game: 
Suppose there are \B\ lottery tickets, only one of them being the winning ticket 
(but the algorithm is oblivious of the winning ticket). The algorithm must go 
over all tickets, and every time it considers a ticket, either it buys it, or it ignores 
it; in either case, after the algorithm makes its decision about the current ticket, 
the adversary discloses the ticket to the algorithm (hence the algorithm gets to 
know whether the ticket was the winner). Last, the algorithm can buy at most l 
tickets. Clearly, the probability the algorithm buys the winner is (at most) ^ y 
This is also an upper bound on the probability A will open both optimal facilities 
conditioned on the event there exist \B\ bad pairs of facilities. Combining the 

above, we have that with probability at least (1 — 2e - ™)(l — ¥), A does not 

open both optimal facilities, hence its total cost is at least 2 n, as argued earlier 
in the proof. Summarizing, the expected cost of A is lower-bounded by 

(1 - e“™)(l - ' -) • 2 n. 

k n 

The theorem follows since for large values of n (and thus for k as well) the 
ratio E ^jp T ] can be made arbitrarily close to □ 

Theorem 2. There is a lower bound of 1.5 — e on the approximation ratio of 
every randomized (on the orderings) Fixed Priority Greedy facility location 
algorithm, for arbitrarily small e. 

Proof. Suppose we have a set C of n cities with id’s 1 . . . n. Define the set F of 
potential facilities as follows: For a fixed integer i £ [|] facility /) covers cities 
1 . . . | excluding city i, and also covers city f + L The set F consists of all ff s 
and /As, for i £ [|], and each facility has an opening cost equal to 2 — e, for 
arbitrarily small e. Recall that for every i, fi , f t are complementary facilities, 
and that the distance between a facility and a city not covered by it is equal 
to 3. 

Consider a deterministic Fixed Priority Greedy algorithm A, and denote 
by (respectively If) the potential input that consists of all facilities of the form 
f £ F (resp. f £ F) as well as facility f i (resp. /,), for i £ [|]. The actual input 
to A, which we denote by z, is selected uniformly at random from the set / U I, 
where / = {I,, i £ [§]} and I = {Ii,i £ [§]}, i.e., a specific actual input is chosen 
with probability equal to -. Since every potential input contains (exactly) one 
pair of complementary facilities, OPT can cover all n cities by opening the two 
complementary facilities, hence E [OPT] = n + 2(2 — e). Denote by Af the cost 
of facilities opened by A. Note that 
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E [A f ] = Pr (z £ I) ■ E[A f \z £ I}+Pr(z £ I) ■ E[A f \z £ /] 

= 1 -(E[A f \zGl] + E[A f \zeI}). (3) 

In order to show the wanted, we give expressions for E[dy |2 £ I] and E[Ty |,2 £ 
/]; the intuition is that not both expressions can be very small, hence E[Ty] is 
large. 

Lemma 2. E[A /|2 £ I] + E[A/|z £ I] > n( 1 — §). 

The theorem follows, since from (3) we have E[A/] > 1|(1 — §), and the 
expected total cost paid by A is at least 4^(1 — e'), for arbitrarily small e'. □ 

4.2 Randomized Priority Facility Location in the City-Input Model 

Theorem 3. No fully-randomized, priority algorithm for facility location in the 
city-input model is better than a 2-approximation. 

Proof. The set C of potential input items (cities) consists of n 2 cities. There are 
(" ) facilities, each covering (i.e., at distance 1) a set of n cities in C, and no 
two facilities cover the same set of cities. The distance from a given facility to 
every city it does not cover is 3, and the opening cost of every facility is equal 
to 2. Each of the n 2 cities in C will appear in the actual input with probability 
— . Thus, the expected number of cities in the actual input is n, and since there 
is a facility that covers these n cities, E[OPT] = n + 5. 

We will bound the expected cost of a deterministic algorithm A. First, note 
that every time a city c not covered by an opened facility is considered, it is to 
the algorithm’s benefit to open a new facility: the total cost of opening a new 
facility (say /) that covers c and connecting c to / is 3, which is the connection 
cost that must be paid if no facility is opened; therefore, we assume that when A 
considers an uncovered city it must open a new facility (this will only strengthen 
the bound). To reflect this action, we say that (the uncovered) city c opens 
facility /. 

Let Y be the random variable that denotes the set of facilities that A opens. 
We will show that the probability Y covers all cities and |y| < ^ — d^Jn (for 
constant d) is very small. 

2 

Lemma 3. Pr(T covers all cities and |T| < § — dy/n) < 4e~~s~. 

2 

Lemma 3 suggests that E[A\ > (1 — 4e~T ) • 2 • (| — dy/n) + n. Observe that 
for arbitrarily small e, one can find a constant d such that E ^jp T ] > 2 — e. □ 

5 Randomized Priority Makespan Scheduling 

The results we show in this Section hold under the assumption that the algorithm 
does not know the number of jobs in the input (the lower bounds for deterministic 
priority algorithms for the problem, shown in [8], rely on this assumption as well). 
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5.1 Fully- Randomized Priority Makespan Scheduling 

Theorem 4. No fully-randomized priority makespan scheduling algorithm has 
approximation ratio better than yy, for all m> 2. 

Proof. As in the proof of the deterministic | bound in [8] , the set S of potential 
jobs consists of 2[yrJ jobs of size 3, and 3|~yrl jobs °f size 2- If the actual input 
is S itself, the optimal makespan is 6, and is achieved by scheduling 2 jobs of 
size 3 on each of [yrj machines and 3 jobs of size 2 on each of the remaining 
fyy] machines. The following fact is shown in [8]: 

Fact 1 If the actual input set is S and the algorithm schedules two jobs of differ- 
ent sizes in the same machine, then the makespan of the algorithm is at least 7. 



We will bound the expected makespan of a deterministic algorithm A for a 
probability distribution over the actual inputs defined as follows: with probability 
p, to be determined later, the actual input to A is S, while with probability 1—p 
the actual input is a set S' C S, with |S"| = m+\ff ~| + 1 (S' can be any subset 
of S of the above size) . 

Lemma 4. At the point where A has scheduled precisely m+ f™] + 1 jobs, one 
of the following holds: 

• The current makespan is at least 6. 

• Two jobs of different sizes were scheduled on the same machine. 

• No two jobs of different sizes were scheduled on the same machine, and if 
l2,h is the number of machines on which jobs of sizes 2 and 3 were scheduled, 
respectively, then I2 > \ff\ or I3 > L^J- 

Denote the optimal makespan on input S' by opt(S'). It is not difficult to see 
that opt(S') € {4,5}. Note that E [OPT] =6 p+ opt(S')( 1 —p). To bound E[A] 
we consider cases concerning the behavior of the algorithm at the point where 
exactly m + [y] +1 jobs have been scheduled, as suggested by Lemma 4. 

• The (current) makespan of the algorithm is at least 6. Then E[A] > 6p + 6(l — 
P ) = 6 . 

• Two jobs of different sizes are scheduled on the same machine. Then E[A] > 
7 p + 5(1 — p) (from Fact 1). 

• No two jobs of different size were scheduled on the same machine, and I 2 > [y ] 
or h > |_tJ’ or ’ equivalently, I 3 < J or l 2 < |~yr]. On the event the actual 
input is S, this implies that either two jobs in S of different sizes are eventually 
scheduled on the same machine, or one of the following holds: A machine will be 
assigned at least 4 jobs of size 2, or a machine will be assigned at least 3 jobs 
of size 3 (depending on whether 1 3 > |_ttJ , or l 2 > respectively). In each 
case, the makespan of A is at least 7. Thus, E[A] >7 p + opt(S')( 1 — p). 

It follows that maxp mm gp^pt ’ 6 ^ with the constraint opt(S’) € {4, 5}, 
is a lower bound on the approximation ratio of A. The expression is maximized 
at p = which gives a lower bound of yf . □ 
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5.2 Fixed-Priority Makespan Scheduling 
with Randomized Decisions 

Theorem 5. No Fixed Priority randomized- decisions algorithm for make- 
span scheduling with m > 3 has approximation ratio better than // . 

Proof sketch. The set of potential jobs S is as defined in the proof of Theorem 4. 
In order to determine the appropriate input distribution, we observe the sequence 
a of the first m jobs in the ordering specified by a given deterministic Fixed 
Priority algorithm A, and consider cases for a. 

6 Oblivious vs. Adaptive Adversaries 

Wlrat if the adversary is not oblivious, but instead it can observe the outcome 
of the algorithm’s random orderings and/or random decisions? We call such an 
adversary adaptive 5 using, again, terminology from the analysis of online algo- 
rithms. Since the adaptive adversary has knowledge of the algorithm’s random 
choices, it works by removing input inputs from the set of potential input items 
based on the outcome of such choices. This means that the actual input to the 
algorithm, and hence the value of the optimal solution will both be random vari- 
ables. A priority algorithm is a c-approximation against an adaptive adversary 
if E[_4 (ct)] <c-E[OPT(ct)]. 

The following theorem provides some evidence that in the context of priority 
algorithms, the adaptive adversary may be too powerful to be of any interest 
(reminiscent of the online world, where there is no advantage to using random- 
ization against adaptive-offline adversaries [5]). 

Theorem 6. Suppose that, using the techniques of [8], we can show (construc- 
tively) that a problem is not c-approximable by a deterministic priority algorithm. 
Then the problem is not c-approximable by any randomized priority algorithm 
against the adaptive adversary. 

7 Open Questions and Future Directions 

Apart from improving the lower bounds, if possible, it would be very interesting 
to use the intuition behind the lower-bound constructions so as to design better 
randomized priority algorithms. For instance, would our results suggest better 
upper bounds for randomized priority makespan scheduling? 

We believe that the framework is applicable to a wider class of optimization 
problems; in particular, we believe that it can be applied in the model of Im- 
pagliazzo and Davis [19], so as to address the power of randomized greedy-like 
algorithms for graph-related problems. Finally, several questions that have been 
asked in the context of the competitive analysis of online algorithms can be asked 
about the class of algorithms we study. For instance, how critical is memory in 

5 Not to be confused with adaptive priority algorithms. 
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randomized priority algorithms? (see [6] for a discussion of the situation in the 
online world). Also, can we extend Theorem 6 to show that if a randomized pri- 
ority algorithm is a c-approximation against the adaptive adversary, then there 
exists a deterministic c-approximation priority algorithm? 
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Abstract. We investigate the problem of routing traffic through a con- 
gested network in an environment of non-cooperative users. We use the 
worst-case coordination ratio suggested by Koutsoupias and Papadim- 
itriou to measure the performance degradation due to the lack of a cen- 
tralized traffic regulating authority. We provide a full characterization 
of the worst-case coordination ratio in the restricted assignment and 
unrelated parallel links models. In particular, we quantify the tradeoff 
between the “negligibility” of the traffic controlled by each user and the 
coordination ratio. We analyze both pure and mixed strategies systems 
and identify the range where their performance is similar. 

1 Introduction 

Overview: In most communication networks it is infeasible to maintain one cen- 
tralized authority to route traffic efficiently. As a result, users may decide in- 
dividually how to route their traffic. Each user behaves selfishly in the sense 
that he wishes to minimize his transfer cost while being aware of the network 
congestion caused by other users. A system of users decisions is said to be in a 
Nash equilibrium if no user can benefit from changing his decision. Simple exam- 
ples in game theory show that the performance of systems in Nash equilibrium, 
achieved by non-cooperative users, can be far from the global optimum. Re- 
cently, the question of quantifying the decrease in network performance caused 
by the lack of a centralized authority, received considerable attention among 
researchers. Koutsoupias and Papadimitriou [5, 8] suggested to investigate the 
worst-case coordination ratio, which is the ratio between the worst possible Nash 
equilibrium and the global optimum, as a mean to understand the cost incurred 
due to the lack of a centralized regulating authority. 

Problems of this type have been studied lately by two approaches. The worst- 
case coordination ratio in a network composed of m parallel related links was 
analyzed in [4, 5,8], while making no assumptions on the relative amount of traf- 
fic controlled by each user. On the other hand, [11, 12] obtained improved bounds 
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** Research supported in part by the Israeli Ministry of industry and trade. 



R. Solis-Oba and K. Jansen (Eds.): WAOA 2003, LNCS 2909, pp. 41—52, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 




42 



Baruch Awerbuch et al. 



for general networks while assuming that each user controls only a negligible frac- 
tion of the total traffic. We attempt to bridge between these two approaches, by 
analyzing the worst-case coordination ratio as a function of the relative fraction 
of the total traffic controlled by each user. Specifically, we quantify the required 
“negligibility” of each user’s traffic, needed to bound the worst-case coordination 
ratio by a constant. 

We focus on two new models. First, we study the restricted assignment model 
(also called the subset model) which is defined as follows. The network consists of 
m parallel links, there are n users where user j has amount of traffic Wj, that can 
be transferred through any link from a subset Sj of the m links. We consider 
both the pure strategies case where each user selects one link to transfer his 
traffic, and the more general case of mixed strategies where each user decides a 
probability distribution over his allowable links. In both cases each user is aware 
of the decisions made by other users. Each user behaves selfishly and wishes to 
minimize his cost by assigning his traffic to the least loaded link. The global 
objective however, is to minimize the load of the most loaded link. We note that 
even though this model is a simplification of real communication networks, it 
captures the essence of basic networking problems as pointed out by [5, 6, 8]. 

Additionally, we study the more general model of unrelated links in which 
a task j (j = 1 .... , n) is associated with an m - vector Wj specifying its weight 
on each link. We analyze the worst-case coordination ratio as a function of the 
maximum stretch s in the system, where s = ma <oc . 

Our results: We provide a full characterization of the coordination ratio for 
both pure and mixed strategies in the restricted assignment and unrelated links 
models. Specifically, we prove the following results: 

— We show the following tight bounds for the restricted assignment model, as 
a function of the ratio r between the optimal assignment and the largest 
task: 

• For 1 < r < log m we prove that the coordination ratio is bounded by 

0 ( r.log 1 ( 0 l S +^) ) hl the PUre Strate S ieS C3Se ’ aild ^ 0 ( , -log log(l+ ■■*-) ) 

in the mixed strategies case. 

• For r = j7( lo S 2 m ) we show that the coordination ratio for both pure and 
mixed strategies systems is bounded by 1 + e. 

Note that general bounds of <9( lo for pure strategies and ®( log l° g ™ gm ) 

for mixed strategies are obtained when r = 1, i.e. when making no assump- 
tions on the largest task in the system. We also note that for r = f?(log?n) 
the coordination ratio is bounded by a constant, for both pure and mixed 
strategies. 

— In the unrelated links model we prove that the coordination ratio is bounded 
by (s + loc( ‘°^g t j ) in pure strategies systems, and 6>(s+ log(1 " ~) 
when mixed strategies are allowed. 

Some of our results are obtained by extending the techniques used in [4] to 
the restricted assignment and unrelated links models. 
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Related work: Koutsoupias and Papadimitriou [5] initiated the study of worst- 
case coordination ratio in networks composed of m-parallel related links with 
possibly different speeds. They first investigated the case of two links and proved 
a worst-case coordination ratio of 3/2 for the case of identical links, and (f> = 
1+ 2 f° r links with possibly different speeds. They also obtained non-tight 
bounds for the general case. Mavronicolas and Spirakis [6] continued this line of 
research while focusing on the special case of fully-mixed strategies in which the 
probability of assigning any task to any link is non-zero. They proved that in 
this case the worst-case coordination ratio is bounded by 6>( ) for both 

the identical links model and the general related links model where all tasks have 
equal weights and m < n. Czumaj and Vocking [4] proved tight bounds for the 
m-parallel related links model, and showed that the worst-case coordination ra- 
tio is bounded by in the identical links model and by <9( log }°^ gm ) 

in the general related links model. Czumaj et al. [3] continued to study this 
problem and characterized the coordination and bicriteria ratios for different 
families of cost functions. 

Rouglrgarden et al. [10, 12] also examined the degradation in network perfor- 
mance due to unregulated traffic. Their model deals with general networks where 
users adopt pure strategies only, and the amount of traffic of each user is assumed 
to be a negligible fraction of the total traffic. The objective is to minimize the 
total latency. They proved that when the latency of each edge is a linear function 
of the edge congestion, any flow at Nash equilibrium has total latency at most 
4/3 of the optimal flow. Although for general latency functions the coordination 
ratio is unbounded, the following bicriteria result can be shown: for any network 
with continuous non-decreasing latency functions, a flow at Nash equilibrium 
has total latency no more than that of an optimal flow forced to route twice as 
much traffic. Rouglrgarden [11] also showed that the cost of unregulated traffic 
does not depend on the complexity of the network topology. He also studied the 
impact of latency functions belonging to specific classes. Rouglrgarden et al. [1, 
2,9] studied various ways to construct and price networks such that the cost 
incurred in unregulated traffic is minimized. 

Paper structure: The paper is organized as follows. Section 2 includes formal 
definitions and notations. The restricted assignment model is studied in section 3. 
In section 4 we analyze the unrelated links model. 



2 Definitions and Notations 

The restricted assignment model is defined as follows: there are m parallel links 
and n users, where user j (j = 1, ... , n) has a task with weight Wj, that can be 
assigned to any link from a subset Sj of the m links. We denote the largest task 
in the system by w max = maxi <j< n Wj. Given an instance of the problem we 
define the global optimum (denote it by OPT) to be the assignment of tasks to 
links that minimizes the maximum load of a link. We denote the ratio between 
the value of the optimal solution and the largest task by r = °) PT . The unrelated 
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links model is more general. Task j (J = 1 , , n) is associated with an to - vector 
Wj , where Wij indicates the weight of task j on link i. 

We assume that the users are non-cooperative and each one wishes to mini- 
mize his own cost with no regard to the global optimum. We consider two types 
of users strategies systems: 

1. Pure strategies: user j selects link lj £ Sj and assigns his task to it. Each 
user is aware of the choices made by all other users when making his decision. 

2. Mixed strategies: user j selects a probability distribution {ptj} ( i £ Sj) 
over the allowable set of links for task j. Each user is aware of the probability 
distributions selected by all other users. 

For the remainder of this section we regard pure strategies as a special case 
of mixed strategies, and describe our definitions in terms of mixed strategies 
systems. Given a system S of mixed strategies with probability distributions 
{pfj}, we define the following random variables: 

— A set of indicator random variables {Xf}, where Xf indicates whether task 
j is assigned to link i. By definition: Pr[Xfj = !] =Pij- 

— For each link i (i = 1, . . . ,to) we define a random variable Lf , indicating 
the total load on the link: Lf = Yflj=i w i ' Ac/- We denote the maximum 
expected load by p s = maxi<j< m E[L?}. 

— We define a random variable Lf nax = maxi<j< m Lf to indicate the maximum 
link load, and denote its expectancy by ^ aa , = £[Lf, ra ]. Clearly, /x^ Q2 . > 

For simplicity of notation, throughout the paper we omit the superscript S 
when meaning is clear from context. 

Definition 1. The expected cost of user j for assigning his task to link i in 
system S is defined as: cf = E[Lf ) + (1 ~pf)wj. 

Definition 2. A system S is said to be in Nash Equilibrium if and only if for 
every task j and link i, pf > 0 only if i = argmini<fc< TO cjy. 

Definition 3. The worst-case coordination ratio of an instance of the problem 
is defined as R = maxg opt ’ w ^ ere ^ e maximum is taken over all strategies 
systems S in Nash equilibrium. 



3 Worst-Case Equilibria 

in the Restricted Assignment Model 

In this section we provide tight bounds for worst-case equilibria in the restricted 
assignment model. We investigate both pure and mixed strategies. We first show 
upper bounds for the problem, and then provide matching lower bounds. 
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3.1 Upper Bounds for Restricted Assignment 

We begin by proving an upper bound on the maximum expected load in any 
system in Nash equilibrium. Recall that r = OPT , without loss of generality we 
assume that r is integral. 

Lemma 1. Let S be any system in Nash equilibrium. Then / jl s < ((3+ I) • OPT , 
where f3 satisfies the inequality: e(^) 13 < m~ . 

Proof. We order the links by non-increasing order of their expected loads. For 
every k > 1, define to* to be the minimal integer such that E[L m + i] < k-w max 
(mi, = m if there is no such integer). We define l = I I. Note that nil > 1. 

v 7 L 31) max J 

The next claim states an important property regarding the relation between m k 
and rrik+i- 

Claim 1. For every k> 1, — — > h+1 . 

Proof. Denote by Jk+i the set of tasks with positive probability to be assigned 
to any link from [1, . . . , m.k+ 1 ], and denote the total weight of tasks from Jk+i by 
W(Jk+ 1 ) = +1 UJ j ■ Consider the way OPT schedules the tasks from Jk+i- 

OPT can not assign a task from Jk+i to a link with index larger than m.k- Let 
us assume, for contradiction, that OPT assigns task j G Jk+i to link t > ruk- In 
addition, we assume that in our system p q j > 0 for some link q < nik+i- There 
follows: 



Cqj — (1 Pqj^^j — T 1 )w max T (1 Pqj'l'Mj T {k 1 )'W rnax 

> k ■ w max + (1 - Ptj)wj > Ctj , 



where the first inequality follows from the definition of q and the third inequality 
results from Wj < w max (j = 1 , . . . , n) . This contradicts the fact that the system 
S is in Nash equilibrium. Hence, OPT must assign all tasks from Jk+i to links 
in the range [1 ,...,mfc]. Let L° PT denote the load on link i in the optimal 
assignment. Recall that w max = We conclude that 

m m + 1 

m k ■ OPT > L ? PT > W(J k+ 1) > E \- L <\ ^ ( fc + !) 

i= 1 i= 1 






i ? 



which yields the desired inequality. □ 

We use Claim 1 iteratively together with the inequality (n!/fc!) > ^'/ej to 
obtain the following: 



in > m r > 



1(1-1). ..(r+1) 



Id 



r l—r 



■! . r l-r 



> 






(i) r 



r l—r 



Substituting l = (3-r we get: e(^)P < vnf! r and p s < (l + l)QET. = (/3+ I) OPT. 

□ 
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Pure Strategies. In the following theorem, derived from Lemma 1, we show 
an upper bound on the worst-case coordination ratio as a function of the ratio 
r between the optimal solution and the largest task in the system. 

Theorem 1. For pure strategies, when 1 <r< log m, R = O ( ~ log (°+™g — y) • 

As a direct result from Theorem 1, we obtain the following general upper 
bounds for two extreme cases. In the first case we make no assumptions regarding 
the amount of traffic controlled by each user, i.e. r = 1. In the second case the 
tasks are relatively small, i.e. r = 0(logm). 

Corollary 1. For pure strategies, R = 0( ) • 

Corollary 2. For pure strategies, when r = 0(logm), we have R = 0(1). 

Next, we study the coordination ratio in networks where each user controls 
a small fraction of the total traffic, i.e. r = L?(logra). The next theorem shows 
that in such networks the coordination ratio is close to 1. 

Theorem 2. For any 0 < e < 1, if r = f2( lo | ™ ), then R< 1 + e. 

Proof. From Lemma 1 we know that R < f3 + £ where (3 satisfies the inequality: 
e(-)@ < m 1 / r . When taking r = I?( Ios 2 m ) we obtain using Taylor expansion: 

e 0 < t2 ) > m 1/r > e = e i(^-i) a -o(0J-i) 8 ) ) 

and the inequality yields R < (3 + \ < 1 + e. □ 



Mixed Strategies. The following theorem gives an upper bound on the coor- 
dination ratio for the mixed strategies case. 

Theorem 3. For mixed strategies, R = O ( lo f m i 0 . — ) ■ 

\ r-log log ( 1 H — ) / 

Proof. For convenience we scale the tasks weights such that w max = 1. Consider 
an arbitrary link i. Recall that Li = ^” =1 WjX^, where Wj < w max = 1 for 
j = 1 ,...,n, and p = maxi<j< TO E[Li\. We apply Hoeffding inequality and 
obtain for every c > 1: 



Pr [Li > c • p\ < 





We can now apply the union bound to obtain: Pr[L max > c- p\< mfe/c )°' Al . For 
every integer a > 1 we can upper bound the expected maximum load by 



OO 

l^max Oi ' [l ^ ^ Pi \Lrnax ^ k * 

k—a 
oo 

< a • fi + m • y^(e/fc) fc ' M < a • n + 2 m • fi • (e/a) Q: ‘ M , 

k—a. 
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where the last inequality follows from the fact that the sequence is sub-geometric. 
We can substitute a for where T _1 is the inverse Gamma function 

and r~ 1 (x) = <9( l0 g°f 0 g We then have: (e/a)“' M < 1/m, hence p, max < (a + 

2) ii. Since a = O ^ there follows: ii max = O ^ ^ ■ Since OPT > 

r, Theorem 1 implies that /i = O — y) , substituting p we conclude that 

Umax = O ( loglo g°®™i£Ly) and therefore R = 0 ( r ^ ogl ^f 1 jT 5 ^y) ■ □ 

The following upper bounds for two extreme cases are derived directly from 
Theorem 3. 

Corollary 3. In the restricted assignment problem R = O( log log™ gm )- 

Corollary 4. When r = 6>(logm) we have R = 0(1). 

The next theorem refers to networks where each user controls a small fraction 
of the total traffic. 

Theorem 4. For any 0 < e < 1, if r = f?( lo | 2 m ), then R < 1 + e. 

Proof. For convenience we scale the tasks weights such that w max = 1, hence 
OPT = r. By the Clrernoff bound, for every 1 < i < to, and 0 < e < 1: 

Pr[L, > (1 + e/3)/x] < Pr[ L t + (/r - E[L t )) > (1 + e/3)/z ] 

2 

< < m~ 3 , 

where the first inequality follows since p — E[Lf\ > 0, the second inequality 
results from the fact that E[Li + (/z — E[Li])\ = /i, and the last inequality 
follows from p > OPT = r. By applying the union bound, with high probability 
L m ax < (1 + e/3 and therefore iL max < (1 + e/3 )/i. By using Theorem 2 with 
e/3 we conclude that Umax < (1 + e/3)(l + e/3) OPT < (1 + e)OPT and the 
theorem follows. □ 



3.2 Lower Bounds for Restricted Assignment 



We begin by proving a tight lower bound for the pure strategies case. We note 
that our lower bounds are proved even for unit tasks. 



Theorem 5 



. For pure strategies, R = 12 ^ 



log m 



r-log(l+ log ) 



Proof. We construct the following problem instance: 

links: we partition the m links into l + l groups (the value of l will be determined 

• links. Denote the 



. , l there are yjm 



later) such that in group k = 0, . 
number of links in group k by n^. 
tasks: we partition the tasks into l groups. In group k = 1, ..., I there are k 
unit tasks, each can be assigned to any link from groups [k — 1, . . . , l]. 



n k 
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Observe that OPT < r + 1 for this problem instance. The optimal solution 
assigns the tasks of group k (k = 1, . . . , l) to the links in group k — 1, at most 
r + 1 tasks per link. We define the following system of pure strategies, denote it 
by S: all tasks from group k (k = 1, . . . , l) are assigned to links from group k, k 
tasks per link. 

Claim 2. The system S is in Nash equilibrium. 

Proof. Denote by (k = 1, . . . , l) the set of links to which tasks from task group 
k can be assigned. Let j be a task from group k and consider the assignment of 
j to link i from link group k. Clearly, Cij = k, and for each link t € Sk we have 
Ctj > (k — 1) + 1 = k > Cij. Hence the system S is in Nash equilibrium. □ 

We now turn to bound the coordination ratio. Since the number of links is m 
we should satisfy: 



m > 



l\ vA r k 



=Vm 

fc= l 



l\ 



using Stirling’s formula we bound the last expression from above by 



\frn ■ 



(i+o(i))V2Td.(ty 

j • e < m. 



and by taking l = (3r, we conclude that R = J?(^) = l?(/3) = i? ( ~ log (°+™g — y) ■ 

□ 



Using Theorem 5, we can derive a tight lower bound (for pure and mixed 
strategies) for the case where each user controls a small fraction of the total 
traffic. The proof is omitted. 

Theorem 6. For any 0 < e < 1, if r = O( lo f 2 m ), then R > 1 + e. 

We can modify our construction from Theorem 5 to obtain a tight lower 
bound for the mixed strategies case. 

Theorem 7. For mixed strategies, R = ft ( ~ log ^ i og — y) • 

Proof. We slightly modify the problem instance constructed in the proof of The- 
orem 5. Everything remains the same except tasks group l now contains (Z — l)-nj 
tasks. Clearly, OPT < r + 1. We introduce the following system of mixed strate- 
gies, denote it by S: 



— All tasks from group k (k = 1, . . . , l — 1) are assigned to links from group k, 
k tasks per link (the same as the construction in the proof of Theorem 5). 

— Each task from group l has uniform distribution over the links from group l. 



We first prove that the system S is in Nash equilibrium. 
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Claim 3. The system S is in Nash equilibrium. 

Proof. The cost of task j from group l on any link i in group l is: c-ij = {l — 1) + 
(1 — 1 /y/rn). On the other hand, for any link t not in group l : c t j = (l— 1)+1 > c^. 
The proof concerning tasks from groups [1, — 1] is identical to the proof 
given in Claim 2. □ 

In the following claim we determine a lower bound on Umax- 
Claim 4. Mmax = ^ (logloga + l^))- 

Proof. Consider the assignment of tasks to links in group l. There are (l — 1) • yfrn 
unit tasks each with uniform distribution over the ypm links. This corresponds 
to a model of throwing (l — 1) • ypm balls uniformly to yfrn bins (see e.g. [7]). 

In this model the expected maximum occupancy is: Q (l + i og ((io^; ? m)/i) ) • U our 
case this lower bound corresponds to /x^ ax = fl ( log loj |° f ™io g — y) • □ 

Since OPT < r + 1, our lower bound on the coordination ratio in the case of 
mixed strategies follows directly from Claim 4. □ 



4 Analysis of the Unrelated Links Model 

Recall that in the unrelated links model, a task j is associated with an m-vector 
Wj = (wij, . . . ,w m j) specifying its weight on each link. Clearly, this model 
generalizes the restricted assignment model. We define the maximum stretch in 
the system as s = ma Xj t ij :w <00 . In the next sections we show tight bounds 

for pure and mixed strategies as a function of s. Note that by increasing s the 
coordination ratio becomes arbitrarily large. 



4.1 Upper Bounds for the Unrelated Link Model 

We begin by proving an upper bound on the maximum expected load in any 
system in Nash equilibrium. 

Lemma 2. Let S be a system in Nash equilibrium. Then 



fi S = o 



log m 



log(l 



log m \ 



■ OPT. 



Proof. We order the links by non-increasing order of their expected loads. For 
every k > 1, define to be the minimal integer such that E[L m + i] < k ■ OPT 
( mfe = to if there is no such integer). Let h = [[I s / OPT\ . The following claim 
shows a relation between ink and mk+i- 

Claim 5. For every k > 1, — — > 

& — ' m -|-i — s 
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Proof. Denote by Jk+i the set of tasks assigned by S with positive probability 
to a link from [1, . . . , rrik+ 1 ], and W (Jk+i) is the total weight of the tasks from 
Jk+i under the assignment of S. Suppose that there is a task j £ Jk+i which 
is assigned by OPT to a link t > rrik, and let q < rrik+i be a link to which j is 
assigned by S with positive probability. Then, 



c q j > ( k + 1) • OPT > k ■ OPT + (1 — Ptj)wtj > c t j, 



contradicting the assumption that S is in Nash equilibrium. Therefore, all the 
tasks in Jk+i are assigned by OPT to links from [1, . . . , to*,]. Hence, 



m k 



OPT >Y^ L° PT > 



W{J k+ i) (A: + 1) • OPT 



> 



^fc +1 ? 



and the claim follows. 

Now, if ft < s then we are done. Otherwise, using Claim 5 we obtain that 
ft(ft-l) ■■■(« + !) ft! _ / l 



□ 



to > m s > 



c h—s 



I . s 1 gl s/hg 



It follows that ft = O ( 1=1 

The next two theorems bound the worst-case coordination ratio for pure 
and mixed strategies. Theorem 8 follows directly from Lemma 2. The proof of 
Theorem 9 is by a similar analysis to the one used in the proof of Theorem 3, 
and is omitted. 

Theorem 8. For pure strategies R = O (^s + log ^iT g — y) • 

Theorem 9. For mixed strategies R = O ( s + - — 5 ’ 1 ° gm l0 . — - ) . 

V log(l+s-log — S— ) ) 

4.2 Lower Bounds for the Unrelated Link Model 

In this section we show matching lower bounds. 

Theorem 10. For pure strategies, R = f2 (s + — y) ■ 

Proof. We prove the theorem by constructing two instances of the problem. 
Without loss of generality we assume that s is integral. 

We first assume that s < log m. We give a construction similar to the one in 
Theorem 5: We partition the links into Ks + 1 groups, where K < ). 

For k = 0, . . . , K — 1 and i = 1, . . . , s, group number ks + i contains nk s +i = 
y/m. ■ links, and group 0 contains no = to — ni links. Note that 
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fcs+l K - 1 

■(K\) s + ^2 n l < y/rn- (A'!) s + \fm- ^ s 

l-l k - 0 

so no > y'ro • (A'!) s . 

The tasks are partitioned into Ks groups. For k = 0, . . . , K — 1 and i = 
1, . . . , s, the (ks + i)-th group contains (k + 1) • n^s+i tasks. Each task in that 
group has weight 1 on a link from group ks + i— 1, weight s — (s — i)/(k + 1) on 
a link from group ks + i , and an infinite weight on all other links. 

The optimal assignment is to assign each task from the Z-th group of tasks to 
a distinct link in the (Z — l)-tli group of links. Thus, OPT = 1. Now, consider the 
following system S of pure strategies: Each task from the Z-th group is assigned 
to a distinct link in the Z-th group of links. Clearly, the load on a link from the 
Z-th group is exactly Z for Z > 1, and in particular, the maximum load is Ks. 

The system S is in Nash equilibrium: For a task j from the Z-th group that 
was assign to a link i , we have that c t j = Z, and for any link k yf i, Ckj > 
(Z — 1) T 1 — Z. Therefore, the coordination ratio is f2(Ks). Since we can take 

K=n(l + Jjjgfe.) . it follows that *-«(,+ sjgfc;) . 

We now handle the case when s = l?(logm). Consider the following problem 
instance: there are to tasks, where task j has weight 1 on link j, weight s on link 
(( j + 1) mod to), and an infinite weight on any other link (assume the links are 
numbered [0, . . . , m — 1]). 

Clearly, OPT = 1 by assigning task j to link j (j = 0, . . . , m — 1). Consider 
the system of pure strategies S where task j is assigned to link (( j + 1) mod to). 
Clearly, the system S is in Nash equilibrium. Moreover, the load on each link is 
s. Hence, the coordination ratio is f?(s). □ 



m s 

(k\y 



< y/m-(Kl) s (1 + es) < 



We can modify the construction from Theorem 10 and obtain a tight lower 
bound for the mixed strategies case. The proof relies on the same arguments 
used in the proof of Theorem 7 and is omitted. 

Theorem 11. For mixed strategies, R = fl ( s + - — s ' l ° gm lo „ — - ) . 

V log(l+s-log - 2 *— )/ 
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Abstract. We consider the on-line load balancing problem where there 
are m identical machines (servers). Jobs arrive at arbitrary times, where 
each job has a weight and a duration. A job has to be assigned upon 
its arrival to exactly one of the machines. The duration of each job be- 
comes known only upon its termination (this is called temporary tasks 
of unknown durations). Once a job has been assigned to a machine it 
cannot be reassigned to another machine. The goal is to minimize the 
maximum over time of the sum (over all machines) of the squares of the 
loads, instead of the traditional maximum load. 

Minimizing the sum of the squares is equivalent to minimizing the load 
vector with respect to the I 2 norm. We show that for the £2 norm greedy 
algorithm performs within at most 1.50 of the optimum. We show (an 
asymptotic) lower bound of 1.33 on the competitive ratio of the greedy 
algorithm. We also show a lower bound of 1.20 on the competitive ratio 
of any deterministic algorithm. 

We extend our techniques and analyze the competitive ratio of greedy 
with respect to the l p norm. We show that the greedy algorithm performs 
within at most 2 — 12(1 /p) of the optimum. We also show a lower bound 
of 2 — 0(lnp/p) on the competitive ratio of any on-line algorithm. 



1 Introduction 

We are given m parallel identical machines and a number of independent jobs 
(tasks) arriving at arbitrary times, where each job has a weight and a duration. A 
job should be assigned upon its arrival to exactly one of the machines based only 
on the previous jobs without any knowledge on the future jobs, thus increasing 
the load on this machine by its weight for the duration of the job. The duration 
of each job becomes known only upon its termination (this is called temporary 
tasks of unknown durations). The load of a machine is the sum of the weights 
of the jobs assigned to it. For any £ p norm we define the cost of an assignment 
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for an input sequence of jobs as the maximum over time of the £ p norm of the 
load vector. Specifically, the £oo norm is the makespan (or maximum load) and 
the 1 2 norm is the Euclidean norm, which is equivalent to the sum of the squares 
of the load vector. The goal of an assignment algorithm is to assign all the jobs 
so as to minimize the cost. 

Consider for example the case where the weight of a job corresponds to its 
machine disk access frequency. Each job may see a delay that is proportional to 
the load on the machine it is assigned to. Then the average delay is proportional 
to the sum of squares of the machines loads (namely the £2 norm of the corre- 
sponding machines load vector) whereas the maximum delay is proportional to 
the maximum load. 

We measure the performance of an on-line algorithm by its competitive 
ratio. An on-line algorithm is c-competitive if for each input the cost of the 
assignment produced by the algorithm is at most c time larger than the cost of 
the optimal assignment. 

We first summarize our results. 

— For the £ 2 norm, we show that the greedy algorithm is at most 1.493- 
competitive (2.2293-competitive for the sum of the squares of loads) for any 
number of machines. In fact, for in = 2 greedy algorithm is optimal and its 
competitive ratio is 1.145 and for m = 3 we can improve the upper bound 
to 1.453. 

— For the £2 norm, we show that there is no on-line algorithm that is 1.202- 
competitive (1.447-competitive for the sum of the squares of loads). 

— For the £2 norm, we show a lower bound of — O(-) on the competitive 
ratio of any on-line algorithm for m machines and we show a lower bound 
of -^= for to divisible by 3. 

— For the £2 norm, we show (an asymptotic) lower bound of 1.338 on the 
competitive ratio of the greedy algorithm. 

— For the general £ p norm (for any p > 1) , we show that the greedy algorithm 
is at most 2 — L?(l/p)-competitive for any number of machines. 

— For the general £ p norm (for any p > 1), we show (an asymptotic) lower 
bound of 2 — 0(lnp/p) on the competitive ratio of any on-line algorithm. 

— For the general i p norm (for any p > 1) , we show that for m=2 the greedy 
algorithm is an optimal on-line algorithm. 

Temporary tasks, norm: For the problem of on-line load balancing of 
temporary tasks the upper bound is 2 — A This upper bound was proved for 
permanent tasks (tasks that never depart) by Graham [12], nevertheless Gra- 
ham’s analysis of the upper bound holds also for temporary tasks. The results 
in [4] show that his algorithm is optimal by constructing a lower bound of 2 — A 
on the competitive ratio of any deterministic on-line algorithm. 

Permanent tasks, norm: The permanent tasks model is the model where 
tasks only arrive (on-line), but never depart. This is the classic ancient problem 
of scheduling jobs on identical machines minimizing the makespan (or maximum 
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load). Graham [12] showed that the greedy load balancing algorithm is 2— ~ 
competitive in this case. The greedy algorithm is an optimal on-line algorithm 
only for m < 3 [9]. 

Bartal et al. [6] were the first to show an algorithm whose competitive ratio 
is strictly below c < 2 (for all m). More precisely, their algorithm achieves 
a competitive ratio of 2 — Later, the algorithm was generalized by Karger, 
Phillips and Torng [16] to yield an upper bound of 1.945. Subsequently, Albers [1] 
designed 1.923-competitive algorithm. Fleischer and Wahl [10] improved this 
result to a ratio of 1.9201. 

Bartal et al. [7] showed a lower bound of 1.8370 for the problem. This result 
was improved by Albers [1] to a ratio of 1.852 and then by Gormley et al. [11] 
to a ratio of 1.853. The best lower bound currently known is due to Rudin [15], 
who showed a lower bound of 1.88. 

Permanent tasks, l p norm: Chandra and Wong [8] were the first to consider 
the problem of minimizing the sum of squares of the machine loads. They showed 
that if the jobs arrive in non-increasing weights order then the greedy algorithm 
yields a schedule whose cost is within || of the optimal cost. This result was 
slightly improved by Leung and Wei [17]. Chandra and Wong [8] also considered 
the general t p norm (for any p > 1) and showed that the greedy algorithm on 
the sorted items achieves a constant performance bound. The constant depends 
on p and grows to | when p grows to oo. The problem of on-line load balancing 
in the general £ p norm (for any p > 1) for permanent tasks was considered in [3]. 
The results in [3] show that for the sum of the squares, the greedy algorithm 
performs within | of the optimum, and no on-line algorithm achieves a better 
competitive ratio. For the sum of the squares [3] also provided on-line algorithm 
with competitive ratio | — <5, for some fixed 5, for any sufficiently large number 
of machines. For the general £ p norm the results show that the competitive ratio 
of greedy algorithm is 2 — (9 ((hip) jp). 

Off-line results: Azar et al. [5] presented a polynomial time approximation 
scheme for the problem of off-line load balancing of temporary tasks (in the too 
norm) in the case where the number of machines is fixed. For the case in which 
the number of machines is given as part of the input (i.e, not fixed) they showed 
that no polynomial algorithm can achieve a better approximation ratio than | 
unless P = NP. 

For the problem of off-line load balancing of permanent tasks (in the too 
norm), there is a polynomial time approximation scheme for any fixed number 
of machines [13, 18] and also for arbitrary number of machines by Hoclrbaum 
and Slrmoys [14]. 

Off-line scheduling and load balancing of permanent tasks with respect to 
the £ p norm has been considered in [2]. The off-line minimization problem is 
known to be NP-lrard in the strong sense. Alon et al. [2] provided a polynomial 
approximation scheme for scheduling jobs with respect to the £ p norm for any 
p > 1. An example in which the optimal assignment for the sum of the squares 
is different than the optimal assignment in the too norm is also given in [2]. 
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2 Definitions and Preliminaries 

In the load balancing problem we are given m identical machines (servers) 
and a finite sequence of events. We denote the input sequence by a = <ti, . . . , ay. 
Each event cq is an arrival or departure of a job (task) . We view a as a sequence 
of times, the time cq is the moment after the i— event happened. We denote the 
weight of a job j by Wj, its arrival time by a,j and its departure time (which is 
unknown until it departs) by dj. An on-line algorithm has to assign a job upon 
its arrival without knowing the future jobs and the durations of jobs that have 
not departed yet. We compare the performance of on-line algorithms and an 
optimal off-line algorithm that knows the sequence of jobs and their durations 
in advance. Let Jj = {j | cij < <Ji < dj} be the active jobs at time cq. A 
schedule S is an assignment which assigns each job j to a single machine k, 
1 < k < m. For every schedule S, the load of machine k at time cq, denoted 
L l k (S), is the sum of weights of all jobs assigned to machine k in S, and active 
at this time. The vector of loads at time cq is L l (S) = ( L\(S ), . . . ,L l m (S)). 
Our cost measure is the l v norm. Hence the cost of a schedule S at time cq is 

defined as ||L*(/S)|| P = (X)fcLi(^fc(^)) P ) ■ The cost of a schedule S is defined as 
||L(5)|| P = max,; ||L*(S')|| p . We denote the load vector with the maximum cost, 
by L(S) = (L 1 (S),...,L m (S)). 

The optimal cost, denoted OPT(S), is the minimal cost over all possible 
schedules for the given sequence of S. 

We measure the performance of our algorithms by the competitive ratio. 
For a fixed p > 1, the competitive ratio of a schedule S is defined as C(S) = 
\\L(S)\\ p / OPT(S). Let A be an on-line assignment algorithm. The competitive 
ratio of A for a fixed number m > 1 of machines is defined as 

Ca, m = sup{C(S) | S' is a schedule produced by A on m machines}. 

The competitive ratio of A for an arbitrary number of machines is 

defined as Ca = sup{C, 4 , m | m > 1}. 

The previous definitions cover also the case where we measure the sum of 
squares of loads, since then the cost is (||L(S)|| 2 ) 2 . Consequently, the competitive 
ratios for the sum of the squares of loads are equal to (C(S)) 2 , (CU jm ) 2 and (Ca ) 2 
w.r.t. the £2 norm. 

Now we define the notion of a shape of a schedule, which is an abstraction 
of a schedule where for every machine, all jobs assigned to it except for one 
are replaced by very small jobs with the same total load. In general it may be 
impossible to produce such a schedule by the same algorithm as the original one. 
Nevertheless, the concept of a shape is very useful for proving upper bounds on 
the competitive ratio, since the optimal assignment may improve (by partitioning 
the jobs) while the cost of the assignment does not change. Hence a shape is 
a pessimistic estimate of a schedule. A shape characterizes each machine by 
two numbers, cq is the total load of the small jobs, and u, is (a lower bound 
on) the weight of one large job. We denote a shape by a pair R = ( a,u ), 
where a and u are vectors of m nonnegative reals. The vector of loads of a 




Load Balancing of Temporary Tasks in the l p Norm 



57 



shape is defined as L(R) = a + u. The competitive ratio of a shape R is 
C(R) = \\L(R)\\ p /OPT(R). 

It is possible to compute the optimal cost of the shape R = (a, u) explicitly. It 
is the cost of a schedule in which some big jobs are scheduled each on a separate 
machine and the rest of the jobs are balanced evenly on the rest of the machines. 
Let the machines be ordered so that u, are nondecreasing. For 1 < l < m let 
hi = a i + Ei=i u i)/l- Let k be the largest l such that hi > m ( k is always 

defined, since hi > u±). We define the height of the shape to be h(R) = hk- 

It is easy to see that a good candidate for an optimal schedule for the shape 
R is to put on each machine one job of size exactly u t and partition a,; into a 
few jobs so that they can be balanced exactly on the k machines; then the load 
vector is (hk, . . . ,hk, Wfc+i, . . . , u m ). See the Figures 1 and 2 for examples where 
Oj = 1 for all i. 



Fig. 1. A shape R. 




Fig. 2. Optimal assignment of R. 




Now we extend the notion of shape and define continuous shapes, which 
are defined similarly to shapes. It is an extension, which treats the machines 
as points in the interval [0,m]. The load of machines is a function defined on 
that interval and is the sum of two functions. One function is the total load of 
very small jobs on each machine. The other function is the load of one big job 
on each machine. Formally a continuous shape is a pair R = ( a,u ), where a 
and u are functions (not necessarily continuous), defined in the interval [0, m]. 
The function of loads of a shape is defined as L(R) = a + u. Where a(t) 
represents the total load of very small jobs at point t in the interval [0,m] and 
u(t) represents the load of one big job at point t in the interval [0, m\. From the 
convexity of the function x p it follows that the optimal cost of a continuous 
shape R is obtained by assigning some big jobs each on a separate machine and 
the remaining jobs are balanced evenly on the rest of the machines. Formally 
w.l.o.g let u(t) be a non-decreasing function. There exists i' , s.t u'(t) = u(t) 
and 

n'U\ - I 0 t < t' 

l SWF (JT a + JT u(t)dt) - u(t) t' < t 

and for t\, £2 , s.t t\<if < t%, it holds that u'(t\) > a' fa) + u'(t 2 ) 
and R' = (a! ,u') gives the optimal load L(R') for the shape R. 
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Transition from a shape to a continuous shape is defined as follows. Let 
R = (a,u) be a shape, then its continuous shape R' = ( a',u ') is 

a! (t) = { a,i i — 1 < t < i 
u'(t) = { Uj z — 1 < t < i . 

Note that when moving from a schedule to a shape and from a shape to a 
continuous shape the cost of the assignment does not change while the cost of 
the optimal assignment can only decrease. 

3 The Greedy Algorithm 

In this section we analyze the competitive ratio of the greedy algorithm defined 
below. 

Algorithm Greedy: Upon arrival of a job j assign it to the machine 
with the current minimum load (ties are broken arbitrarily). 

Note that a job departs from a machine it was assigned to. 

To obtain a constant upper bound for the performance ratio of Greedy, we 
show that each schedule can be replaced by a very special continuous shape so 
that the competitive ratio does not decrease. Computing the competitive ratio 
is then shown to be equivalent to computing the maximum of a certain function 
with equality constraints over the reals. A shape R = (h, x ) is called partial 
flat if there exists an integer 1 < k < m, and a real c > 0 such that the following 
conditions hold: 

hi = c for i = 1, . . . , k 

Xi > 0 for i = 1, . . . , k 

hi = Xi = 0 for i = k + 1, . . . , m. 

When k = m the shape is called flat. A shape R = (h, x) is called separate 
if there exists an integer 1 < k < m and a real c > 0, such that the following 
conditions hold: 

hi = 0 for i = 1, ... ,k 
Xi > c for i — 1 , ... ,k 
hi = c for i = k + 1> . . . , m 
Xi = 0 for i = k + 1, . . . , m. 

Let 5 be a schedule obtained by Greedy and let L(S) be the load when 
Greedy reaches the maximum cost. Let h be the load vector of all jobs except 
the last job assigned to each machine, we treat these jobs as very small jobs 
and call them sand jobs. Let x be the weight vector of the last job assigned to 
each machine. The shape R = (h, x) is a shape of the schedule S , we call it the 
Greedy shape. Recall that when moving from a schedule to a shape and from 
a shape to a continuous shape the cost of the assignment does not change while 
the cost of the optimal assignment can only decrease. 
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Lemma 1. Let R = (h,x) be the Greedy shape of a Greedy schedule S, normal- 
ized such that ( OPT(S)) p = m. Then there exists a partial flat shape R' = (h' , x') 
such that ||L(i?)|| p < ||L(i?')|| p and OPT(R) > OPT(R') with the non-zero com- 
ponents of h! equal to 1, and the off-line shape of the shape R ' is a separate 
shape. 



Proof. It is easy to see that hi < 1 (otherwise when the last job was assigned to 
machine i before Greedy reached the maximum cost, ( OPT(S)) p > m, which is 
a contradiction). 

W.l.o.g we assume that the machines are ordered so that hi + Xi are non 
increasing. We perform the following transformation. Note that in each step of 
the transformation the cost of greedy can only increase and the cost of the off- 
line algorithm can only decrease. This is in particular due to the convexity of the 
function x p . Figure 3 shows the obtained shape at the end of each transformation 
step. 



1. In this step we move to a continuous shape R = (h(t),x(t)) , where h{t) and 
x{t) are functions in the interval [0, m]. We treat each point t in that interval 
as a machine and the value h{f) + x(t) as its load. Now we transform regular 
jobs (jobs or part of jobs) that are placed below height 1 to sand jobs. Next 
we push the sand jobs left such that the sand jobs height will be equal to 
1 from point 0 to point Vo, where Vo is the total volume of the sand jobs. 
Formally, let R = (h(t),x(t)) be the current shape and let to be maximal, 
such that h(t) + x(t) > 1 then the new shape R' = x'(t)) is obtained 

as follows. Denote 

pm 

Vo = to + / ( h(t ) + x(t))dt 

Jto 



then 



h\t) 



1 t<V 0 
0 V 0 <t 



X \t) = { k ^ + 1 t ~ t ° 

( 0 to < t 

2. Jobs that in the off-line algorithm are scheduled on machines with sand jobs 
are pushed left to have the same height as the load of those machines in the 
off-line algorithm (which are balanced). Formally, let R = (h(t),x(t)) be the 
current shape then the new shape R' = (h'(t), x'(t)) is obtained as follows. 
Let t [ be a minimal point such that machine t\ has sand jobs in the off-line 
algorithm, let w be its total load in the off-line algorithm and let Vj be the 
volume of the regular jobs on machines Denote 



Vi 




x(t)dt 



h'(t) = h(t) 



then 
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x{t) t <t\ 

0 otherwise 



3. Sand jobs on machines with no regular jobs are transformed continuously 
to regular jobs of height equal to w (as defined in the previous step). We 
put these jobs on the leftmost machines possible that have only sand jobs. 
We perform this process from right to left. This process continues until all 
machines in the greedy shape have sand jobs and a regular job. Formally, let 
R = (h(t),x(t)) be the current shape then the new shape R' = 
is obtained as follows. Let t\ be maximal such that machine t\ has sand jobs 
and a regular job. Let be maximal such that machine t -2 has jobs (any 
jobs) . Let s = , then 



h\t) = 



1 t < s 
0 otherwise 



( x(t ) t < t\ 
x'(t) = < w t\ < t < s 
y 0 otherwise 

It is easy to see that in each of the transformation steps the cost of greedy 
can only increase and the cost of the off-line algorithm can only decrease due to 
the convexity of the function x p . We denote the shape obtained by the trans- 
formation by R' = (h',x r ). This shape has jobs on machines [0, s] for some real 
number 0 < s < m. Each machine t £ [0, s] has sand jobs of height 1 and a 
regular job of height x'(t) > 0, other machines have no jobs assigned to them, 
hence this is a partial flat shape. The off-line shape has jobs of height x'(t) on 
machines t £ [0, s] and sand jobs of total volume s evenly assigned to machines 

(s, ml. In addition min x'(i) > w = s/(m — s), hence the off-line shape is a 
te[o,s] 

separate shape. This completes the proof. 

Lemma 2. Let R = (h, x) be a partial flat shape such that h(t) = 1 for 0 < t < 
s. Assume that the off-line shape of R is a separate shape. Then there is a shape 
R ’=(h,x’), such that for 0 < t < s, x'(t) = y for some value y > 0 and otherwise 
x'{t) = 0 and it holds that ||L(f?)|| p < ||L(i? , )|| p and OPT(R) = OPT(R') . 

Proof. Let 5 • in = s. Define 

-1 n8-m 

y = (— / x p (t)dtf. 
d-mj o 



Clearly OPT(R) = OPT(R'). Now 
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Fig. 3. The transformation steps. 



nS-m 

\\L(R)\\p = { (1 +x(t)ydtf 

JO 

pS-m n5-m 

< ( / 1 dt)~ + ( / x p {t)dt)~ 

Jo Jo 

= (8 • m)~ + (<5 • m) ~ ■ y 
= {8 ■ m ) 1 • (1 + y) 

= IIWIIp 

where the inequality follows from the triangle inequality for the t v norm. This 
completes the proof. 

Define the function f(S : x) with the constraint g(6,x). 

f(5, x ) = s-(i + xy, 

g{S : x) = S-x p + 



= 1 . 



(1) 

(2) 
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Theorem 1. The competitive ratio of Greedy algorithm satisfies 

G Greedy if ( fmax ) 

where f max is the maximum of f with constraint (2), in the domain 0 < x, 0 < 

<5<5- 

Proof. Let Rq be the Greedy shape of a schedule S obtained by Greedy. For 
simplicity we transform to a new shape R± by normalizing all job weights such 
that 

(OPT{R 1 )) p = m. (3) 

If all the h components of R\ are equal to zero then the Greedy schedule is 
optimal. Otherwise by applying Lemma 1 and Lemma 2 we obtain from R\ a 
partial flat shape R 2 = {h, x), in which all the non zero components of x are the 
same and its off-line shape is a separate shape such that ||L(i?i)|| p < ||L(i? 2 )||p 
and OPT{Rf) > OPT(R 2 ). We have 

(\\L(R 2 )\\ p y = S-m-(l + xy, (4) 

pS-m c 

(OPT(R 2 )) p = / x p {t)dt+{l-5)m{ — - r 

Jo I 1 - 0) ■ m 



c „ S p ■ m 


(5) 




6 • m 6 


(6) 


(1 — S) ■ m 1 — 6 



The last inequality restricts the weight of a regular job to be greater then the 
total weight of sand jobs on machines with sand jobs in the off-line shape. For 
simplicity we divide equalities (4) and (5) by m, this does not change the ratio 
between the cost and the off-line cost of shape R 2 . Which gives the following 



x) = S ■ (1 + x) p , 


(7) 


, „ s p 


(8) 


1 A • x ^ 

- + 




(9) 





The left side of the equality is f{S,x) by definition. The first inequality results 
from (5), since ( OPT(R 2 )) p < (OPT(Ri)) p < m and the division by m. 
Substituting (9) in (8) gives 

(5P+ 1 { S p _ S p 

1 - (i - s)p + (i - sy- 1 ~ (1 - S) p 

which yields S < We obtain the following relation for / 

f(s r] = ^\\m2)\\ P ) p ( 11 ^) 11 ^ ms)\\ P y 

JK ’ ’ h-m ~ (OPTiR^P ~ (OPT{S)) p 
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where the first inequality follows from (3) and the fact that \\L(Ri) || p < ||L(i? 2 ) || P - 
Hence to bound the competitive ratio of Greedy we need to solve the following 
maximum problem in the domain 0 < S < \ and 0 < x. We need to find the 
maximum of f(5, x) under the constraint (8). It is easy to see that the maximum 
of / is obtained when (8) is an equality. Hence we obtain the following maximum 
problem for / with constraint g (1), (2) in the domain 0 < x,0 < 6 < This 
completes the proof. 

The following theorem results from Theorem 1. 

Theorem 2. For £2 norm the competitive ratio of Greedy algorithm is C Greedy < 

1.493. 

Proof. Omitted. 

The upper bound can be improved for 3 machines. 

Theorem 3. For £ 2 norm and m = 3 the competitive ratio of Greedy algorithm, 
C Greedy , 3 — 1-453. 

Proof. Omitted. 

Now we turn to the case of the general £ v norm. 

Theorem 4. For any p > 1 it holds C Greedy < 2 — fi . 

Proof. Omitted. 

Theorem 5. For m = 2 the greedy algorithm is optimal and its competitive 
ratio is 

r< _ ^l + (l + a;) p ^ /irA 

C Greedy, 2 - SUp ^ + ^ j , (10) 

and the supremum is achieved as a maximum at the unique solution x € (0,oo) 
of the equation 

x p - 1 {l + {l + x) 1 - p ) = 2 p . 



Proof. Omitted. 

From the above theorem, which claims that for m — 2 Greedy is optimal 
and its competitive ratio is the same as in the permanent tasks case, we obtain 
according to [3] that for the £2 norm and for two machines Greedy is optimal 

and its competitive ratio is J (\/^ + 3)/4 « 1.145. 
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4 Lower Bounds 

4.1 Lower Bounds for £ 2 Norm 

In this section we give lower bounds for the £2 norm. We prove a lower bound 
for any algorithm (the proof is for m = 3) . Then we prove a weaker lower bound 
for any m > 3. Finally we prove a lower bound for Greedy for a large number of 
machines (m — > 00 ). 

Theorem 6. For any on-line assignment algorithm A, Ca > Ca , 3 > 1.202. 

Proof. Consider the following sequence for three machines. First three unit jobs 
and one job of weight x > 1 arrive. Then two unit jobs depart. At last one 
job of weight 2 arrive. Consider the first three unit jobs. If algorithm A assigns 
two or more jobs to the same machine, it does not get any other job. Its cost 
is at least 5, the optimal cost is 3, and we are done. Otherwise, algorithm A 
assigns one unit job to every machine (the off-line algorithm assigns two unit 
jobs to machine 1 and one unit job to machine 2). Now the next job of weight 
x arrives. Algorithm A assigns it to one of the machines say 1 (the off-line 
algorithm assigns it to machine 3). Then two unit jobs depart, which are the 
jobs on machines 2,3 (the jobs on machine 1 in the off-line algorithm). At last 
a job of weight 2 arrive. The best algorithm A can do is to assign it to one of 
the empty machines 2 or 3 (the off-line algorithm assigns it to machine 1). Its 
cost is at least \/{l + x ) 2 + 2 2 , whereas the optimum cost is a/2 2 + 1 + x 2 . The 
maximal ratio ss 1.202 is achieved for x = a/5- 

Theorem 7. For any number of machines m > 3 and any on-line assignment 
algorithm A, Ca , m > 2/ a/3 — 0(A). For m divisible by 3, CA,m > 2/a/3. 

Proof. Let m = 3 k. We consider the following sequence. First 4 k unit jobs arrive. 
Then 2 k unit jobs depart. Finally k jobs of weight 2 arrive. Consider the arrival 
of the first 4 k unit jobs. Algorithm A assigns these jobs. W.l.o.g we assume 
that machines 1 are sorted in nondecreasing order of load (the off-line 

algorithm assigns two jobs on each machine 1 , ... ,k and one job on each machine 
k + 1, . . . ,3k). Then 2k jobs depart. There exists a minimal t. > 2k, such that 
machines 1 are assigned at least 2k jobs. Then 2k jobs from machines 

1 , ... ,t depart as follows, all jobs from machines 1 , . . . , t — 1 and some jobs from 
machine t (in the off-line algorithm the jobs on machines 1 , . . . , k depart). At the 
end of this step machines 1, ... ,2k are empty. Next k jobs of weight 2 arrive. The 
best algorithm A can do is to assign each job to an empty machine (In the off- 
line these jobs are assigned to machines 1 ,... ,k). Finally there are jobs of total 
weight 4 k assigned to no more than 2k machines. Due to the convexity of the 
function x p , the minimum cost is obtained when all machines have the same load, 
therefore its cost is at least \j2k ■ (2) 2 . The optimum cost is y/k • 2 2 + 2k ■ l 2 , 
which yields a ratio of 2/ a/3. For m not divisible by 3 a similar proof gives a 
ratio of 2/a/3 — 0(A). 

Theorem 8. For the greedy algorithm, C Greedy > 1.338. 
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Proof. First we prove a weaker lower bound of 1.314 for the greedy algorithm, 
by solving an ordinary differential equation analytically. Then by a similar proof 
we obtain a more complex ordinary differential equation, which has no simple 
solution. Hence we use a computer program to compute the competitive ratio in 
this case, which gives a lower bound of 1.338. 

We start with the first proof. We see the m machines as to points in the 
interval (0, 1], machine i as the point — £ (0, 1], and the load of the machines as 
a function /(t), f(t) = U for < t < — , where U is the load of machine i. For 
each machine i the total load is the value of f in the interval ( — — 1 and the 
total load of all machines is the total volume of / in the interval (0, 1] multiplied 
by to. Let f(k/m) be the load of machine k at the end of step k and let F{k/m) 
be the volume of the jobs assigned to machines k, ... ,m at the beginning of step 
k in the following process. For convenience we number the steps from to. to 1 in 
decreasing order. In this process we keep the volume of jobs fixed and equal to 1 
at the end of each step. We start with the arrival of infinitesimally small jobs of 
total volume 1, we call jobs of this type sand jobs. Both the off-line and greedy 
algorithms assign these jobs evenly on all the machines (total height 1 on each 
machine). At step k a job of height x (x > 1) arrives. Greedy assigns this job 
to the machine with minimum load w.l.o.g to machine k which is the one with 
the largest index among all machines with the minimum load 1 , ,k (otherwise 
we swap indices) and the off-line algorithm performs the same assignment. Then 
the sand jobs on machines 1, . . . , k — 1 depart in greedy. In the off-line algorithm 
the departing jobs are composed of all the sand jobs of machine k and equal 
amounts of sand jobs from machines 1 , . . . , k — 1 with the appropriate volume. 
Next sand jobs arrive with total volume 1 — F(k/m) — — (=l-total volume of 
machines k,...,m), thus keeping the total volume equal to 1. Greedy and the 
off-line algorithms assign the sand jobs to machines 1 , . . . , k— 1 evenly, such that 
these machines will have the same load. At the end of step k 



f (k/m) 



1 — F(k/m ) 
k/m 



The computation of the lower bound for Greedy according to the above 
scenario, which is technical, is omitted. 



4.2 Lower Bound for General p > 1 

In this section we construct a lower bound for general p > 1. 

Theorem 9. For any p > 1 and any on-line algorithm A it holds that Ca > 

2 -Off). 

Proof. Omitted. 
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Abstract. We address an important communication issue in wireless 
cellular networks that utilize Frequency Division Multiplexing (FDM) 
technology. In such networks, many users within the same geographical 
region (cell) can communicate simultaneously with other users of the 
network using distinct frequencies. The spectrum of the available fre- 
quencies is limited; thus, efficient solutions to the call control problem 
are essential. The objective of the call control problem is, given a spec- 
trum of available frequencies and users that wish to communicate, to 
maximize the number of users that communicate without signal inter- 
ference. We consider cellular networks of reuse distance k > 2 and we 
study the on-line version of the problem using competitive analysis. 

In cellular networks of reuse distance 2, the previously best known al- 
gorithm that beats the lower bound of 3 on the competitiveness of de- 
terministic algorithms works on networks with one frequency, achieves 
a competitive ratio against oblivious adversaries which is between 2.469 
and 2.651, and uses a number of random bits at least proportional to the 
size of the network. We significantly improve this result by presenting 
a series of simple randomized algorithms that have competitive ratios 
smaller than 3, work on networks with arbitrarily many frequencies, and 
use only a constant number of random bits or a comparable weak random 
source. The best competitiveness upper bound we obtain is 7/3. 

In cellular networks of reuse distance fc > 2, we present simple random- 
ized on-line call control algorithms with competitive ratios which signif- 
icantly beat the lower bounds on the competitiveness of deterministic 
ones and use only O(logfc) random bits. Furthermore, we show a new 
lower bound on the competitiveness of on-line call control algorithms in 
cellular networks of reuse distance k > 5. 



1 Introduction 

In this paper we study frequency spectrum management issues in wireless net- 
works. Due to the rapid growth of wireless communications and the limitations 
and cost of the frequency spectrum, such issues are very important nowadays. 

* This work was partially funded by the European Union under 1ST FET Project 
ALCOM-FT, 1ST FET Project CRESCCO and RTN Project ARACNE. 
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We consider wireless networks in which base stations are used to build the 
required infrastructure. In such systems, the architectural approach used is the 
following. A geographical area in which communication takes place is divided 
into regions. Each region is the calling area of a base station. Base stations 
are connected via a high speed network. When a user A wishes to communicate 
with some other user B, a path must be established between the base stations 
of the regions in which the users A and B are located. Then communication is 
performed in three steps: (a) wireless communication between A and its base 
station, (b) communication between the base stations, and (c) wireless commu- 
nication between B and its base station. Thus, the transmission of a message 
from A to B first takes place between A and its base station, the base station of 
A sends the message to the base station of B which will transmit it to B. At least 
one base station is involved in the communication even if both users are located 
in the same region or only one of the two users is part of the wireless network 
(and the other uses for example the PSTN). Improving the access of users to base 
stations is the aim of this work. 

The network topology usually adopted [6, 7] is the one shown in the left 
part of Figure 1. All regions are regular hexagons (cells) of the same size. This 
shape results from the uniform distribution of identical base stations within the 
network, as well as from the fact that the calling area of a base station is a circle 
which, for simplicity reasons, is idealized as a regular hexagon. Due to the shape 
of the regions, we call these networks cellular wireless networks. 

Many users of the same region can communicate simultaneously with their 
base station. This can be achieved via frequency division multiplexing (FDM). 
The base station is responsible for allocating distinct frequencies from the avail- 
able spectrum to users so that signal interference is avoided. Since the spectrum 
of available frequencies is limited, important engineering problems related to 
the efficient reuse of frequencies arise. Signal interference usually manifests itself 
when the same frequency is assigned to users located in the same or adjacent 
cells. Alternatively, in this case, we may say that the cellular network has reuse 
distance 2. By generalizing this parameter, we obtain cellular networks of reuse 
distance k in which signal interference between users that have been assigned 
the same frequency is avoided only if the users are located in cells with distance 
at least k. 

Signal interference in cellular networks can be represented by an interference 
graph G. Vertices of the graph correspond to cells and an edge (it, v) in the 
graph indicates that the assignment of the same frequency to two users lying 
at the cells corresponding to nodes u and v will cause signal interference. The 
interference graph of a cellular network of reuse distance 2 is depicted in the 
right part of Figure 1. If the assumption of uniform distribution of identical base 
stations does not hold, arbitrary interference graphs can be used to model the 
underlying network. 

In this paper we study the call control problem which is defined as follows: 
Given users that wish to communicate with their base station, the call control 
problem on a network that supports a spectrum of w available frequencies is to 
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Fig. 1 . A cellular network and the interference graph if the reuse distance is 2. 



assign frequencies to users so that at most w frequencies are used in total, signal 
interference is avoided, and the number of users served is maximized. 

We assume that calls corresponding to users that wish to communicate ap- 
pear in the cells of the network in an on-line manner. When a call arrives, a 
call control algorithm decides either to accept the call (assigning a frequency to 
it), or to reject it. Once a call is accepted, it cannot be rejected (preempted). 
Furthermore, the frequency assigned to the call cannot be changed in the future. 
We assume that all calls have infinite duration; this assumption is equivalent to 
considering calls of the same duration. 

Competitive analysis [11] has been used for evaluating the performance of on- 
line algorithms for various problems. In our setting, given a sequence of calls, 
the performance of an on-line algorithm A is compared to the performance of 
the optimal algorithm OPT. 

Let B(a) be the benefit of the on-line algorithm A on the sequence of calls 
er, i.e. the set of calls of a accepted by A and 0(a) the benefit of the optimal 
algorithm. If A is a deterministic algorithm, we define its competitive ratio (or 
competitiveness) as max, , where the maximum is taken over all possible 
sequences of calls. If A is a randomized algorithm, we define its competitive 
ratio as max, gljg^jn > where £’[|I3 ((t)|] is the expectation of the number of calls 
accepted by A, and the maximum is taken over all possible sequences of calls. 

Usually, we compare the performance of deterministic algorithms against 
off-line adversaries , i.e. adversaries that have knowledge of the behaviour of 
the deterministic algorithm in advance. In the case of randomized algorithms, 
we consider oblivious adversaries whose knowledge is limited to the probability 
distribution of the random choices of the randomized algorithm. 

The static version of the call control problem is very similar to the famous 
maximum independent set problem. The on-line version of the problem is studied 
in [1-4,8,10]. [1], [2], and [8] study the call control problem in the context of 
optical networks. Pantziou et al. [10] present upper bounds for networks with 
planar and arbitrary interference graphs. Usually, competitive analysis of call 
control focuses on networks supporting one frequency. Awerbuch et al. [1] present 
a simple way to transform algorithms designed for one frequency to algorithms 
for arbitrarily many frequencies with a small sacrifice in competitiveness (see 
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also [5,12]). Lower bounds for call control in arbitrary networks are presented 
in [3]. 

The greedy algorithm is probably the simplest on-line algorithm. When a 
call arrives, the greedy algorithm seeks for the first available frequency. If such 
a frequency exists, the algorithm accepts the call assigning this frequency to it, 
otherwise, the call is rejected. In general, Pantziou et al. [10] show that this 
algorithm has competitive ratio equal to the degree of the interference graph 
and no better in general. The greedy algorithm is optimal within the class of 
deterministic on-line call control algorithms. 

Simple randomized algorithms can be defined using the “classify and ran- 
domly select” paradigm [1,2,10]. Such algorithms use a coloring of the under- 
lying interference graph, randomly select a color out of the colors used, and 
execute the greedy algorithm in the cells colored with the selected color, ignor- 
ing (i.e., rejecting) calls in all other cells. The competitive ratio achieved in this 
way, against oblivious adversaries, is equal to the number of colors used in the 
coloring of the interference graph. 

In cellular networks of reuse distance 2, the greedy algorithm is 3-competitive 
against off-line adversaries, in the case of one frequency. Slightly worse compet- 
itiveness bounds can be proved in the case of arbitrarily many frequencies using 
the techniques of [1,5,12]. In [4], using similar arguments with those of [10], it 
was observed that no deterministic on-line call control algorithm in cellular net- 
works of reuse distance 2 can be better than 3-competitive against off-line adver- 
saries. Applying the “classify and randomly select” paradigm using a 3-coloring 
of the interference graph, we obtain a 3-competitive randomized algorithm even 
in the case of arbitrarily many frequencies. Observe that this algorithm uses a 
very weak random source which equiprobably selects one out of three distinct 
objects. 

The authors in [4] describe algorithm p-R.ANDOM, an intuitive on-line ran- 
domized call control algorithm for networks that support one frequency. They 
present upper and lower bounds on the competitive ratio of the algorithm as 
functions of parameter p and, by optimizing these functions, they prove that, for 
some value of p, the competitive ratio of algorithm p- Random against oblivious 
adversaries is between 2.469 an 2.651. The analysis of algorithm p-RANDOM in 
[4] applies only to cellular networks with one frequency but it indicates that ran- 
domization helps to beat the deterministic upper bounds. However, the number 
of random bits used by the algorithm may be proportional to the size of the net- 
work. The best known lower bound on the competitive ratio of any randomized 
call control algorithm in cellular networks of reuse distance 2 is 1.857 [4], 

Results on on-line call control in cellular networks of reuse distance k > 2 
are only implicit in previous work. The greedy algorithm has competitive ratio 4 
and 5 in cellular networks of reuse distance k € {3, 4, 5} and k > 6, respectively, 
which support one frequency. This is due to the fact that the acceptance of a 
non-optimal call may cause the rejection of at most 4 and 5 optimal calls, respec- 
tively. These competitive ratios are the best possible that can be achieved by 
deterministic algorithms. Using the techniques of [1, 5, 12], it can be shown that, 
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in the case of arbitrarily many frequencies, the greedy algorithm has competi- 
tive ratio at most 4.521 and at most 5.517 in cellular networks of reuse distance 
k £ {3, 4, 5} and k > 6, respectively. 

Furthermore, applying the “classify and randomly select” paradigm using an 
efficient coloring of the interference graph of cellular networks of distance reuse 
k > 2 would give randomized on-line algorithms with competitive ratio fHk 2 ). 
Even in the case of k = 3, the competitive ratio we obtain in this way is 7. 

In this paper, we improve previous results on the competitiveness of on- 
line call control algorithms in cellular networks. We present algorithms based 
on the “classify and randomly select” paradigm which use new colorings of the 
interference graph. These algorithms use a small number of random bits, and 
have small competitive ratios against oblivious adversaries even in the case of 
arbitrarily many frequencies. In particular, in cellular networks of reuse distance 

2, we significantly improve the best known competitiveness bounds achieved by 
algorithm p-R.ANDOM by presenting a series of simple randomized algorithms 
that have smaller competitive ratios, work on networks with arbitrarily many 
frequencies, and use only a constant number of random bits or a comparable 
weak random source. The best competitiveness upper bound we obtain is 7/3. 
In cellular networks of reuse distance k > 2, we present simple randomized 
on-line call control algorithms with competitive ratios which significantly beat 
the lower bounds on the competitiveness of deterministic algorithms and use 
only 0(log k) random bits. For any k > 2, the competitive ratio we achieve is 
strictly smaller than 4. Furthermore, we show a new lower bound of 25/12 on the 
competitiveness, against oblivious adversaries, of on-line call control algorithms 
in cellular networks of reuse distance k > 5. 

The rest of the paper is structured as follows. We present a simple 8/3- 
competitive randomized algorithm in Section 2. Simpler algorithms with better 
competitive ratios (including the algorithm with competitive ratio 7 /3) are pre- 
sented in Section 3. The upper bounds for cellular networks of reuse distance 
k > 2 are presented in Section 4. A Our new lower bound for cellular networks 
of reuse distance k > 5 can be found in Section 5. 

2 A Simple 8/3-Competitive Algorithm 

In this section we present algorithm CRS-A, a simple randomized on-line al- 
gorithm for call control in cellular networks of reuse distance 2. The algorithm 
works in networks with one frequency and achieves a competitive ratio against 
oblivious adversaries which is similar (but slightly inferior) to that which has 
been proved for algorithm p-R.ANDOM. 

Algorithm CRS-A uses a coloring of the cells with four colors 0, 1, 2, and 

3, such that only two colors are used in the cells belonging to the same axis. 
This can be done by coloring the cells in the same x-axis with either the colors 
0 and 1 or the colors 2 and 3, coloring the cells in the same y-axis with either 
the colors 0 and 2 or the colors 1 and 3, and coloring the cells in the same z-axis 
with either the colors 0 and 3 or the colors 1 and 2. Such a coloring is depicted 
in the left part of Figure 2. 
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Algorithm CRS-A randomly selects one out of the four colors and executes 
the greedy algorithm on the cells colored with the other three colors, ignoring 
(i.e., rejecting) all calls in cells colored with the selected color. 





Fig. 2. The 4-coloring used by algorithm CRS-A and the corresponding subgraph of 
the interference graph induced by the nodes not colored with color 3. 



Theorem 1. Algorithm CRS-A in cellular networks of reuse distance 2 support- 
ing one frequency is 8/ 3- competitive against oblivious adversaries. 

Proof. Let cr be a sequence of calls and denote by O the set of calls accepted by 
the optimal algorithm. Denote by a' the set of calls in cells which are not colored 
with the color selected and by O' the set of calls the optimal algorithm would 
have accepted on input a' . Clearly, \0'\ will be at least as large as the subset of 
O which belongs to a'. Since the probability that the cell of a call in O is not 
colored with the color selected is 3/4, it is £[|0'|] > ||0|. Now let B be the set of 
calls accepted by algorithm CRS-A, i.e., the set of calls accepted by the greedy 
algorithm when executed on sequence a' . Observe that each call in O' either 
belongs in B or it is rejected because some other call is accepted. Furthermore, 
a call in B\0' can cause the rejection of at most two calls of O'. This implies 
that \B\ > | O' |/2 which yields that the competitive ratio of algorithm CRS-A is 

\Q\ < 2 | 0 | 8 

£[\B\] - £[\0’\] ~ 3’ 

□ 

The main advantage of algorithm CRS-A is that it uses only two random bits. 
In the next section we present simple on-line algorithms with improved competi- 
tive ratios that use slightly stronger random sources and work on networks with 
arbitrarily many frequencies. 

3 Improved Algorithms 

Algorithm CRS-A can be seen as an algorithm based on the “classify and ran- 
domly select” paradigm. It uses a coloring of the interference graph (not neces- 
sarily using the minimum possible number of colors) and a classification of the 
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colors. It starts by randomly selecting a color class (i.e., a set of colors) and 
then run the greedy algorithm in the cells colored with colors from this color 
class, ignoring (i.e., rejecting) calls in cells colored with colors not belonging to 
this class. Algorithm CRS-A uses a coloring of the interference graph with four 
colors 0, 1, 2, and 3, and the four color classes {0, 1, 2}, {0, 1, 3}, {0, 2, 3}, and 
{1,2,3}. Note that, in the previously known algorithms based on the “classify 
and randomly select” paradigm, color classes are singletons (e.g., [1], [10]). 

The following simple lemma gives a sufficient condition for obtaining efficient 
on-line algorithms based on the “classify and randomly select” paradigm. 

Lemma 1. Consider a network with interference graph G = (V,E) which sup- 
ports w frequencies and let x be a coloring of the nodes of V with the colors of 
a set X. If there exist v sets of colors sq, s i, ..., s v - \ C X and an integer X such 
that 



— each color of X belongs to at least X different sets of the sets sq, si, ..., s u - %, 
and 

— for i = 0, 1, ..., v — 1, each connected component of the subgraph of G induced 
by the nodes colored with colors in Si is a clique, 

then there exists an on-line randomized call control algorithm for the network G 
which has competitive ratio v/X against oblivious adversaries. 

Proof. Consider a network with interference graph G which supports w frequen- 
cies and the randomized on-line algorithm working as follows. The algorithm 
randomly selects one out of the v color classes So, •••, and executes the 
greedy algorithm on the cells colored with colors of the selected class, rejecting 
all calls in cells colored with colors not in the selected class. 

Let a be a sequence of calls and let O be the set of calls accepted by the 
optimal algorithm on input a. Assume that the algorithm selects the color class 
Si . Let a' be the sequence of calls in cells colored with colors in Si and O' be 
the set of calls accepted by the optimal algorithm on input er'. Also, we denote 
by B the set of calls accepted by the algorithm. First we can easily show that 
|.B| = |0'|. Let Gj be a connected component of the subgraph of G induced by 
the nodes of G colored with colors in Sj. Let a j be the subsequence of a' in cells 
corresponding to nodes of Gj. Clearly, any algorithm (including the optimal one) 
will accept at most one call of aj at each frequency. If the optimal algorithm 
accepts w calls, this means that the sequence aj has at least w calls and the 
greedy algorithm, when executed on a 1 , will accept w calls from <jj (one call in 
each one of the available frequencies). If the optimal algorithm accepts w' < w 
calls from aj, this means that <jj contains exactly w' < w calls and the greedy 
algorithm will accept them all in w' different frequencies. Since a call of aj is 
not constrained by a call in aj' for j j' , we obtain that \B\ = |0'|. 

The proof is completed by observing that the expected benefit of the optimal 
algorithm on input a' over all possible sequences a' defined by the random selec- 
tion of the algorithm is £[|0'|] > f\0\, since, for each call in O, the probability 
that the color of its cell belongs to the color class selected is at least v/X. Hence, 
the competitive ratio of the algorithm against oblivious adversaries is 
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£[\B\] S[\0'\] ~ X,V - 

□ 

Next, we present two randomized on-line algorithms for call control in cellular 
networks of reuse distance 2, namely CRS-B and CRS-C, which are also based on 
the “classify and randomly select” paradigm and achieve even better competitive 
ratios. 

Consider a coloring of the cells with five colors 0, 1, 2, 3, and 4 such that for 
each i e {0, 1, 2, 3, 4}, and for each cell colored with color i, the two adjacent cells 
in the same x-axis are colored with colors (i — 1) mod 5 and ( i + 1) mod 5, while 
the remaining four of its adjacent cells are colored with colors (i + 2) mod 5 and 
(i + 3) mod 5. Such a coloring is depicted in the left part of Figure 3. Also, define 
Si = {i, (i + 1) mod 5}, for i = 0,1,..., 4. Observe that, for each i = 0,1,. ..,4, 
each pair of adjacent cells colored with the colors i and (i + 1) mod 5 is adjacent 
to cells colored with colors (i + 2) mod 5, ( i + 3) mod 5, and (i + 4) mod 5, i.e., 
colors not belonging to s^. Thus, the coloring x together with the color classes 
Si satisfy the conditions of Lemma 1 with v = 5 and A = 2. We call CRS-B 
the algorithm that uses this coloring and works according to the “classify and 
randomly select” paradigm as in the proof of Lemma 1. We obtain the following. 





Fig. 3. The 5-coloring used by algorithm CRS-B and the 7-coloring used by algorithm 
CRS-C. The skewed cells are those colored with the colors in set so- 



Theorem 2. Algorithm CRS-B in cellular networks with reuse distance 2 is 
5/2- competitive against oblivious adversaries. 

Now consider a coloring of the cells with seven colors 0, 1, ..., 6 such that for 
each for each cell colored with color i (for i = 0, . . . , 6) , its two adjacent cells in 
the same x-axis are colored with the colors (z — 1) mod 7 and (i+1) mod 7, while 
its two adjacent cells in the same z-axis are colored with colors (i — 3) mod 7 and 
(i+3) mod 7. Such a coloring is depicted in the right part of Figure 3. Also, define 
Si = {i, (i + 1) mod 7, (i + 3) mod 7}, for i = 0,1, ..., 6. Observe that, for each 
i = 0, 1, ..., 6, each triangle of cells colored with the colors i, ( i + 1) mod 7, and 
(i + 3) mod 7 is adjacent to cells colored with colors (i + 2) mod 7, (i + 4) mod 7, 
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(i + 5) mod 7, and (i + 6) mod 7, i.e., colors not belonging to s,. Thus, the 
coloring y together with the color classes Si satisfy the conditions of Lemma 1 
with v = 7 and A = 3. We call CRS-C the algorithm that uses this coloring and 
works according to the “classify and randomly select” paradigm as in the proof 
of Lemma 1. We obtain the following. 

Theorem 3. Algorithm CRS-C in cellular networks with reuse distance 2 is 
7 /3 - competitive against oblivious adversaries. 

The two algorithms above (CRS-B and CRS-C) make use of a random source 
which equiprobably selects one out of an odd number of distinct objects. If 
we only have a number of fair coins (random bits) available, we can design 
algorithms with small competitive ratios by combining the algorithms above. For 
example, using 6 random bits, we may construct the following algorithm. We use 
integers 0, 1, ..., 63 to identify each of the 63 outcomes of the 6 random bits. For an 
outcome i G {0, ..., 49}, the algorithm executes algorithm CRS-B using color class 
s* mod 5 , and for an outcome i £ {50,..., 63}, the algorithm executes algorithm 
CRS-C using color class S(j_ 5 0 ) mo d 7 - It can be easily seen that this algorithm has 
competitive ratio 32/13 ss 2.462 against oblivious adversaries, since its expected 
benefit is at least 50/64 • 2/5 + 14/64 • 3/7 = 13/32 times the optimal benefit. 
Similarly, using 8 random bits, we obtain an on-line algorithm with competitive 
ratio 64/27 ss 2.37. We can generalize this idea, and, for sufficiently small e > 0, 
we can construct an algorithm which uses t = 0(log 1/e) random bits, and, on 
2* mod 7 of the 2* outcomes, it does nothing, while the rest of the outcomes are 
assigned to executions of algorithm CRS-C. In this way, we obtain the following. 

Corollary 1. For any e > 0, there exists an on-line randomized call-control 
algorithm for cellular networks with reuse distance 2 that uses 0 { log 1/e) random 
bits and has competitive ratio at most 7/3 + e against oblivious adversaries. 

4 Cellular Networks with Reuse Distance k > 2 

For cellular networks with reuse distance k > 2, we present algorithm CRS-A; 
which is based on the “classify and randomly select” paradigm. Algorithm CRS- 
k uses the following coloring of the interference graph of a cellular network with 
reuse distance k. Cells are colored with the colors 0,1,..., 3 A; 2 — 3 k such that 
for any cell colored with color i, its adjacent cells in the x-axis are colored with 
colors (i— 1) mod (3fc 2 — 3k+l) and (i+1) mod (3k 2 — 3k+l), while its adjacent 
cells in the z-axis are colored with colors (i — 3 (k — l) 2 ) mod (3 k 2 — 3 k + 1) and 
(i + 3 (k — l) 2 ) mod (3 k 2 — 3 k + 1). 

For odd k, for i = 0, 1, ...,3k 2 — 3k, the color class s, contains the following 
colors. For j = 0, 1, ..., it contains the colors (i + 3j(k — l) 2 — j) mod (3k 2 — 
3k + 1), ..., (i + 3j(k — l) 2 + ^1) mod (3k 2 — 3k + 1), and for j = ..., k — 1, 

it contains the colors (i+ 3 ^ fc ~ 1) +3(j— > Cf^)(k— l) 2 — ^^) mod (3k 2 — 3k + l), 
..., (i + 3( ' k ~ 1 ' 1 + 3 (j — ^=T)(fc — l) 2 + k — 1 — j) mod (3k 2 — 3k + 1). 
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For even k, for * = 0,1, 3 k 2 — 3k, the color class s* contains the following 
colors. For j = 0, 1, | — 1, it contains the colors (i + 3j(k — l) 2 —j) mod (3k 2 — 
3A;+1), (i+3j(k— 1) 2 +|) mod (3k 2 — 3fc+l), and for j = |, k—1, it contains 
the colors (* + 3(| — l)(k — l) 2 + 3(j — | + l)(fc — l) 2 — | + 1) mod (3k 2 — 3k + l), 
..., (i + 3(| — 1 )(k — l) 2 + 3 (j — | + 1 )(k — l) 2 + k — 1 — j) mod (3k 2 — 3k + 1). 

Note that, for k = 2, we obtain the coloring used by algorithm CRS-C. 
Examples of the coloring for k = 3 and fc = 4 as well as the cells colored with 
colors from the color class so are depicted in Figure 4. 

We can show the following two lemmas. Formal proofs will be given in the 
final version of the paper. 

Lemma 2. Let k > 2 and G be the interference graph of a cellular network of 
distance reuse k. Consider the coloring of G used by algorithm CRS-k and the 
color classes Si, for i = 0, 1, ...,3k 2 — 3k. For any color j such that 0 < j < 
3 k 2 — 3k, the number of different color classes Sj that color j belongs to is if 
k is even, and 3fc 4 +1 if k is odd. 

Lemma 3. Let k > 2 and G be the interference graph of a cellular network of 
distance reuse k. Consider the coloring of G used by algorithm CRS-k and the 
color classes Si, for i = 0, 1, ..., 3fc 2 — 3k. For i = 0, 1, ..., 3k 2 —3k, each connected 
component of the subgraph of G induced by the nodes of G colored with colors in 
Si is a clique. 

Thus, the colorings and the color classes described satisfy the condition of Lemma 
1 with A = 3 k 2 — 3k + 1 and v = 3 4 if k is even, and v = 3fc 4 +1 if k is odd. By 
Lemma 1, we obtain the following. 

Theorem 4. The competitive ratio against oblivious adversaries of algorithm 
CRS-k in cellular networks with reuse distance k > 2 is 4 (l — 3 4fc2 1 ) if k is 

even, and 4 ^1 — 3k 3 2 k +1 J if k is odd. 

Algorithm CRS-fc in cellular networks of reuse distance k uses a random 
source which equiprobably selects one among 3k 2 — 3k + 1 distinct objects. By 
applying ideas we used in the previous section, we can achieve similar competi- 
tiveness bounds by algorithms that use random bits. 

Corollary 2. For any e > 0, there exists an on-line randomized call-control 
algorithm for cellular networks of reuse distance k, that uses 0(log 1/e + log A:) 
random bits and has competitive ratio at most 4 (l — /fpr) + e if k is even, and 

4 (l — 3 f! 2 k +1 ^ + e if k is odd, against oblivious adversaries. 

5 Lower Bounds 

In previous work ([4], implicitly in [10]), it has been observed that no determin- 
istic on-line call control algorithm can have a competitive ratio better than 3 
against off-line adversaries. We can easily extend this lower bound and obtain 
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Fig. 4. Examples of the colorings used by the algorithms CRS-3 and CRS-4. The 
skewed cells are those colored with the colors in set sq . 



lower bounds of 4 and 5 on the competitiveness of deterministic on-line call con- 
trol algorithms in cellular networks of reuse distance k £ {3, 4, 5} and k > 6, 
respectively. 

Hence, the randomized algorithms presented in the previous section signifi- 
cantly beat the lower bound on the competitiveness of deterministic algorithms. 
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In what follows, using the Minimax Principle [13] (see also [9]), we prove a lower 
bound on the competitive ratio, against oblivious adversaries, of any random- 
ized algorithm in cellular networks with reuse distance k > 5. Again, we consider 
networks that support one frequency; our lower bound can be trivially extended 
to networks that support multiple frequencies. In our proof, we use the following 
lemma. 

Lemma 4 (Minimax Principle [9]). Given a probability distribution V over 
sequences of calls a, denote by £-p[B^(a)] and £p[Bqpt (cr)] the expected benefit 
of a deterministic algorithm A and the optimal off-line algorithm on sequences 
of calls generated according to V. Define the competitiveness of A under V, 
to be such that 

■p _ £v[Bopt{v)} 

Ca ~ £v[B A (a)} ■ 

Let A n be a randomized algorithm. Then, the competitiveness of A under V is a 
lower bound on the competitive ratio of Ar against an oblivious adversary, i.e. 
C V A <C A ■ 

Our lower bound is the following. 

Theorem 5. No randomized call-control algorithm in cellular networks with 
distance reuse k > 5 can be better than 25/ 12 -competitive against an oblivious 
adversary. 

Proof. Consider a cellular network with reuse distance 5 and ten cells Vo, v A .i, 
va,2, V B ,1, v b ,2, v b ,3, Vc,i, v C,2, v c ,3, and v c ,A as shown in Figure 5. We will 
prove that there exists an adversary AW that produces calls in these cells 
according to a probability distribution V in such way that no deterministic 
algorithm can be better than 25/12-competitive under V even if it knows the 
probability distribution V in advance. 




Fig. 5. The cellular network of reuse distance 5 used in the proof of Theorem 9 and 
the subgraph of the interference graph induced by the nodes corresponding to the ten 
cells vo, va,i, v A ,2, VBA, v b, 2 , v B} 3, vc, i, vc, 2 , vc, 3, and VC, 4- 
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We define the probability distribution V as follows. First, the adversary pro- 
duces a call in the cell vq. Then, it 

— either stops, with probability 1 / 2 , 

— or does the following, with probability 1 / 2 . It presents two calls, one in the 
cell va , i and one in the cell va, 2, and 

• either stops, with probability 1 / 3 , 

• or does the following with probability 2 / 3 . It presents three calls, one in 
the cell vb,1i one i n the cell vb, 2, and one in the cell vb, 3, and 

* either stops, with probability 1 / 4 , 

* or does the following, with probability 3 / 4 . It presents four calls, one 
in the cell vc, 1, one in the cell vc, 2, one in the cell vc, 3, and one in 
the cell vc, 4, and then stops. 

Clearly, the benefit of the optimal off-line algorithm is the number of calls 
presented by the adversary in the last step before stopping the sequence. Thus, 
the expected benefit of the optimal off-line algorithm on sequences of calls gen- 
erated according to V is 






25 

12 



Let A be a deterministic call control algorithm that runs on the calls produced 
by AW. First, we observe that no algorithm could gain by accepting one of the 
two calls presented in cells va. 1 and va,2 or by accepting one or two of the three 
calls presented in cells vb, 1, vb , 2 and vb , 3 • Hence, we may assume that the 
algorithm either accepts all calls presented at a step or rejects them all. 

Consider t, executions of the algorithm on t sequences produced according 
to the probability distribution V. Let qo be the number of executions in which 
A accepts the call produced in cell vq, q± the number of executions in which 
A accepts both calls in cells va, 1 and va,2, and q2 the number of executions in 
which the algorithm accepts the three calls in cells vc, 1, vc,2, and vc, 3- 

The expected number of executions in which the algorithm does not accept 
the call in cell Vq and the adversary produces calls in cells Va, 1 and va,2 is 
-f (t — qo). Hence, the expected number of executions in which the algorithm does 
not accept the calls in cells vo, va, 1, and va,2 and the adversary produces calls 
in cells vb, 1, Vb,2, and vb, 3 is § (|(f — qo) — qi) and the expected number of 
executions in which the algorithm does not accept the calls in cells i>o, va, 1, 
Va,2, Vb, 1, Vb, 2, and vb,3 and the adversary produces calls in cells vc, 1, vq, 2, 
v c ,3, and v c ,i is § (f {\{t - q 0 ) - qi) - q 2 ). Thus, 



£-p[Ba{o)] < 



qo + 2 gi + 3 q 2 + 4 | (| (|(t — qo) — qi) 
t 



92) 



= 1 



and > 25 / 12 . By Lemma 4 , we obtain that this is a lower bound on the 
competitiveness of any randomized algorithm against oblivious adversaries. □ 
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Abstract. Motivated by the problem of allocating optical bandwidth 
in tree-shaped WDM networks, we study the fractional path coloring 
problem in trees. We consider the class of locally-symmetric sets of paths 
on binary trees and prove that any such set of paths has a fractional 
coloring of cost at most 1.367L, where L denotes the load of the set of 
paths. Using this result, we obtain a randomized algorithm that colors 
any locally-symmetric set of paths of load L on a binary tree (with 
reasonable restrictions on its depth) using at most 1.367L + o(L) colors, 
with high probability. 



1 Introduction 

Let T(V,E ) be a bidirected tree, i.e., a tree with each edge consisting of two 
opposite directed edges. Let V be a set of directed paths on T. The path color- 
ing problem (or integral path coloring problem) is to assign colors to paths in 
V so that no two paths that share a directed edge of T are assigned the same 
color and the total number of colors used is minimized. The problem has ap- 
plications to WDM (Wavelength Division Multiplexing) routing in tree-shaped 
all-optical networks. In such networks, communication requests are considered 
as ordered transmitter-receiver pairs of network nodes. WDM technology es- 
tablishes communication by finding transmitter-receiver paths and assigning a 
wavelength to each path, so that no two paths going through the same fiber 

* This work was partially funded by the European Union under 1ST FET Project 
ALCOM-FT, 1ST FET Project CRESCCO and RTN Project ARACNE. 
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are assigned the same wavelength. Since state-of-the-art technology [10] allows 
for a limited number of wavelengths, the important engineering question to be 
solved is to establish communication so that the total number of wavelengths 
used is minimized. 

The path coloring problem in trees has been proved to be NP-hard in [5], thus 
the work on the topic mainly focuses on the design and analysis of approximation 
algorithms. Known results are expressed in terms of the load L of V , i.e., the 
maximum number of paths that share a directed edge of T. Note that, for any 
set of paths, its load is a lower bound on the optimal number of colors. An 
algorithm that assigns at most 2 L colors to any set of paths is implicit in the 
work of Raghavan and Upfal [9] . The best known upper bound for arbitrary trees 
is 5L/3 [6]. A randomized (1.613 + o(l))-approximation algorithm for trees of 
bounded degree is presented in [2] . 

The path coloring problem is still NP-hard when the input instances are 
restricted to sets of paths on binary trees [5] . A randomized path coloring algo- 
rithm for binary trees is presented in [1] . This algorithm colors any set of paths 
of load L on a binary tree (with some restrictions on its depth) using at most 
7L/5 + o(L) colors. In [8], it is proved that there are sets of paths on binary trees 
whose optimal path colorings require at least 5L/4 colors. 

The path coloring problem is still NP-hard when the input instances are 
restricted to symmetric sets of paths on binary trees [3]. A set of paths V is 
called symmetric if it can be partitioned into disjoint pairs of symmetric paths, 
i.e., (u,v) and (v,u). Symmetric sets of paths are important since many services 
which are supported by WDM networks (or are expected to be supported in the 
future) require bidirectional reservation of bandwidth. Algorithms that color 
any symmetric set of paths of load L on a tree (not necessarily binary) with 
3L/2 colors can be obtained by considering each pair of symmetric paths as an 
undirected one and then applying an algorithm for coloring undirected paths on 
undirected trees [5, 9] . For binary trees, the randomized algorithm presented in 
[1] gives the best known upper bound in this case. In [3] it is also proved that 
there are symmetric sets of paths on binary trees whose optimal path colorings 
require at least 5A/4 colors. 

The fractional path coloring problem is a natural relaxation of path coloring. 
Given a set of paths on a tree, a solution of the path coloring problem is also a 
solution for the fractional path coloring problem with cost equal to the number 
of colors in the solution of path coloring. Furthermore, the lower bound proofs 
in [3] and [8] still hold for fractional path coloring. Finding a fractional path 
coloring of optimal cost can be solved in polynomial time in trees of bounded 
degree [2]. Fractional path colorings are important since they can be used to 
obtain path colorings using a number of colors which is provably close to the 
cost of the fractional solution by applying randomized rounding techniques [2, 
7]. Moreover, it is interesting to prove upper bounds on the cost of optimal 
fractional path colorings in terms of the load, since they give insight to the path 
coloring problem. In [2], the result of [1] was extended to prove that any set of 
paths of load L on a binary tree has a fractional path coloring of cost 7L/5. 
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In this paper, we consider locally-symmetric sets of paths on binary trees. A 
set of paths V on a tree is called locally-symmetric if, for any two nodes u and 
v of the tree with distance at most 2, the number of paths coming from v and 
going to u equals the number of paths coming from u and going to v. Clearly, 
the class of locally-symmetric sets of paths on a tree T contain the class of 
symmetric sets of paths on T . We prove that any locally-symmetric set of paths 
of load L on a binary tree has a fractional path coloring of cost at most 1.367L. 
This fractional path coloring has some additional nice properties, and, using this 
result and techniques of [1], we obtain a randomized algorithm that colors any 
locally-symmetric set of paths of load L on a binary tree (with some restrictions 
on its depth) using 1.367L + o(L) colors, with high probability. Since the load 
of a set of paths is a lower bound on the minimum number of colors sufficient 
for coloring it, our algorithm is an (1.367 + o(l) ^approximation algorithm. 

The rest of the paper is structured as follows. In Section 2 we give formal 
definitions for the (fractional) path coloring problem. In Section 3 we discuss 
some properties of locally-symmetric sets of paths which are useful in the analysis 
of our algorithms. We describe the fractional path coloring algorithm in Section 
4 and present its analysis in Section 5. We discuss the extension of the result for 
fractional path coloring to path coloring in Section 6. 

2 Fractional Path Colorings 

The graph coloring problem can be considered as finding an integral covering 
of the vertices of a graph by independent sets of unit weight so that the total 
weight (or cost) is minimized. Given a graph G = (V,E), this means solving the 
following integer linear program: 

minimize ^2j e z x (I) 
subject to Yliei vei x (-0 — 1 v eV 

x(I) € {0, 1} lei 

where T denotes the set of the independent sets of G. 

This formulation has a natural relaxation into the following linear program: 

minimize Xaex^CO 
subject to Eiex-vei ^(-0 - 1 v £V 

x(i ) >o lex 

The corresponding combinatorial problem is called the fractional coloring 
problem. If x is a valid weight assignment over the independent sets of the graph 
G, we call it a fractional coloring of G. 

Given a set of paths V on a graph G, we define an independent set of paths as 
a set of pairwise edge-disjoint paths. Equivalently, we may think of the conflict 
graph of V which is the graph having a node for each path of V and edges between 
any two nodes corresponding to conflicting (i.e., not edge-disjoint) paths; an 
independent set of paths corresponds to an independent set on the conflict graph. 
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The fractional path coloring problem is defined as the fractional coloring problem 
on the conflict graph. 

In [2] it is proved that optimal fractional path colorings in bounded-degree 
trees can be computed by solving a linear program of polynomial (in terms of 
the load of the set of paths and the size of the tree) size. These techniques can 
be extended to graphs of bounded degree and bounded treewidth. A polynomial 
time algorithm for computing optimal fractional path colorings in rings is implicit 
in [7], 

Although computing an optimal fractional path coloring of a locally-symmetric 
set of paths V on a binary tree can be done in polynomial time, in this paper 
we are interested in exploring the relation of the cost of the optimal fractional 
path coloring with the load of V. 

3 Properties of Locally-Symmetric Sets of Paths 

In this section we give some useful properties of locally-symmetric sets of paths 
on binary trees. Let T be a binary tree. Without loss of generality, we assume 
that all nodes of T have degree either 1 or 3. We denote by r (root) a leaf node 
of T. Starting from r, we assign labels to the nodes of the tree by performing a 
breadth-first-search (r is assigned 0). For each non-leaf node v, we will denote 
by p(v) the parent of v, and by l(v) and r(v) the left and right child of v, 
respectively. 

Let V be a locally-symmetric set of paths. We may assume that V is normal , 
i.e., it satisfies the following properties: 

— It has full load L at each directed edge. 

— For every node u, the paths that originate or terminate at u, appear on only 
one of the three edges adjacent to u. 

If the initial set of paths is not normal, we can transform it to a normal 
one by first adding pairs of symmetric single-hop paths on the edges of the tree 
which are not fully-loaded, and then, for each non-leaf node v, by merging paths 
terminating at v with paths originating at v that do not traverse the same edge 
in opposite directions. It can be easily verified that this can be done in such a 
way that the resulting set of paths is locally-symmetric. 

Claim. Let V be a locally-symmetric set of paths on a binary tree T with load 
L. There exists a normal locally-symmetric set of paths V' such that if V' has 
a (fractional) coloring of cost c, then V has a (fractional) coloring of cost at 
most c. 

We partition the set of paths of V that go through v to the following disjoint 
subsets: the set M,| of the paths that come from p{v) and go to l( v), the set 
of the paths that come from l(v) and go to p(v), the set M'f of the paths that 
come from p(v ) and go to r( v), the set M* of the paths that come from r(v) 
and go to p(v), the set of the paths that come from l(v) and go to r(v), and 
the set M® of the paths that come from r(v) and go to l(v). Since V is normal, 
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Fig. 1. The two cases for paths touching a non-leaf node v. Numbers represent groups 
of paths (number 1 implies the set of paths M,[ , etc.). 



we only need to consider two cases for the set P v of paths of V touching node 
v. These cases are depicted in Figure 1. 

— Scenario I: The paths of P v originating from or terminating at v touch 
node p(v). We denote by Mj the set of paths that come from p(v) and stop 
at v and by M® the set of paths that originate from v and go to p(v). 

— Scenario II: The paths of P v originating from or terminating at v touch a 
child node of v (wlog r( v)). We denote by Mf the set of paths that originate 
from v and go to r{v) and by M® the set of paths that come from r(v) and 
stop at v. 

Claim. Let V be a normal locally-symmetric set of paths on a binary tree T with 
load L. Let v be a non-leaf node of T and P v be the subset of P that contains 
the paths touching v. 

— If P v belongs to Scenario I, then \Mf\ = |M^| = |M®| = \M*\ < L/ 2, 
|M„ 5 | = |M®| = L- \Mf\, and |Mj| = |M®| = L — 2 |M/|. 

— If P v belongs to Scenario II, then \M},\ = \M%\ > L/ 2, |M®| = \M*\ = 
\My \ = \My \ = L — \My\, and |Mj| = |M®| = 2\M$\ - L. 

4 The Algorithm 

In this section we present our fractional path coloring algorithm. It uses a pa- 
rameter D £ [2/3, 2+ / //2 ] . 

Let T be a binary tree and V a locally-symmetric set of paths on T . We 
denote by T the set of all independent sets of paths in V. Given an edge e, we 
denote by Xf the set of independent sets of X that contain exactly one path 
traversing e in some direction and by X ^ the set of independent sets of X that 
contain two paths traversing e (in opposite directions). 

Definition 1 (Property 1). A weight assignment x on the independent sets 
of X satisfies Property 1 at a node v if 
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— Property 1 is satisfied on each node u which has label smaller than the label 
ofv, and 

— either v is a leaf different from the root or for any path p traversing an edge 
e between v and a child of v in some direction, it is 

E x{T) = \-D. 

/£/ :p£l 

Definition 2 (Property 2). A weight assignment x on the independent sets 
of I satisfies Property 2 at a node v if 

— Property 2 is satisfied on each node u which has label smaller than the label 
ofv, and 

— either v is a leaf different from the root or for any two paths p, q traversing 
an edge e between v and a child of v in opposite directions, it is 

E *<'> = !■ 

/£/ -.p,qel 



Definition 3 (Property 3). A weight assignment x on the independent sets of 
X satisfies Property 3 at a node v if for any independent set I of I that contains 
at least one path touching neither v nor nodes of label smaller than v ’s, it is 
x (/) = 0. 



Definition 4 (Property 4). A weight assignment x on the independent sets 
of I satisfies Property 4 if 



E x(7 J - 

lex 



AD 2 - AD + 4 
3D 



L. 



Our algorithm assigns weights x to the independent sets of X such that x is 
a valid fractional coloring of small cost. 

First, the algorithm performs the following initialization step: 

— For each independent set I € X consisting only of one path that terminates 
at or originates from r, it sets x r (I) = 1 — D. 

— For each independent set I £ X consisting only of two opposite directed paths 
one terminating at and the other originating from r, it sets x r (I) = D / L. 

— For any other independent set J el, it sets x r (I) = 0. 

Then, for i = 1, . . . ,n — 1 (where n is the number of nodes of the tree) , the 
algorithm executes the procedure Fract-Color on node v with label i. 

The procedure Fract-Color at a node v takes as input the set P v of paths 
touching node v and a non-negative weight assignment x p < v ) which satisfies Prop- 
erties 1, 2 and 3 at node p(y) and Property 4. If v is a leaf, the procedure Fract- 
Color sets the weight assignment x v equal to x p t v \ and stops. Otherwise, it 
computes a new non-negative weight assignment x v which satisfies Properties 1, 
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2 and 3 at node v and Property 4. It also sets the weight assignments x u for all 
the nodes u with label smaller than v’s equal to x v . 

Finally, the algorithm outputs a weight assignment x = x u where u is the 
node with label n — 1. 

Fract-Color will be precisely described in the next section. For now, we use 
its input-output specification to show inductively that the algorithm produces 
a weight assignment x which satisfies Properties 1, 2 and 3 at each node of 
T and Property 4. Clearly, after the initialization step, the assignment x r is 
non-negative and satisfies Properties 1, 2 and 3 at node r. Also, under weight 
assignment x r , there are 2 L independent sets with weight 1 — D, L 2 independent 
sets with weight D / L while all other independent sets have zero weight. Thus, 
Property 4 is satisfied since 



53® r (J) = (2 -D)L < 
/ex 



AD 2 - AD + 4 

3 D 



L. 



Assuming that the execution of Fract-Color at a node u with label i has 
produced a non-negative weight assignment x u that satisfies Properties 1, 2 and 
3 at u and Property 4, we can easily show that the execution of Fract-Color 
at a node v with label i + 1 creates a non-negative weight assignment x v that 
satisfies Properties 1, 2 and 3 at node v and Property 4. By the description of 
Fract-Color, we just have to show that the weight assignment x u which is 
part of the input of the execution of Fract-Color at node v satisfies Properties 
1, 2 and 3 at node p(v) and Property 4. Property 4 is obviously satisfied. Since 
either p(v) = u or p(v) has a label smaller than ids, x u satisfies Properties 1, 2 
and 3 at p(v). 

Let p be a path traversing an edge e between some node v and a child of v in 
some direction. Note that p is contained only in independent sets of the disjoint 
sets if and iff . Furthermore, x satisfies Properties 1, 2 and 3 at node v and 
there are L paths that traverse the edge e in opposite direction to p. We obtain 
that 

x ( 7 ) = Yh + Y = L 

/eX: P e/ /e/ -pel iei -pel 

Also, x is non-negative. Thus, the algorithm specified above finds a fractional 
coloring x of the paths of V of cost at most 4D ^ a f ) D+4 L. In the following section, 
we describe the procedure Fract-Color and prove that it works correctly (i.e. , 
as specified above) provided that D £ [2/3, 2+ 4 v ^ ]. Substituting D = 2 4 2 , we 
obtain the following theorem. 



Theorem 1. For any locally-symmetric set of paths V of load L on a binary 
tree T, there exists a fractional coloring of cost at most 1.367 L. 



5 The Procedure Fract-Color 

Let T be a binary tree and V a normal locally-symmetric set of paths on T. 
We show how the procedure Fract-Color works when executed at a non-leaf 
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node v. Let P v be the set of paths of V touching node v. We have to consider 
two cases: one for Scenario I and one for Scenario II. Due to lack of space, we 
present only the case of Scenario I here. 

We denote by I v the set of independent sets of 1 that contain only paths 
touching nodes with label smaller than v. We denote by 7° the set of independent 
sets in I v that do not contain paths traversing the edge between v and its parent 
in some direction. We denote by 7® for i = 1,2, 3, 4, 7, 8, the set of independent 
sets of I v which contain a path p of M* and no path traversing the edge between 
v and its parent in opposite direction to p, and by I®- 7 the set of independent 
sets of 1 which contain a path of M® and a path of Mf, such that paths of M® 
traverse the edge between v and its parent in opposite direction than paths in 
Mi. 

When executed at v, Fract-Color takes as input the set P v and a non- 
negative weight assignment a : p ( v ){I) on the independent sets of 1 that satisfies 
Properties 1, 2, and 3 at p(v) and Property 4. Fract-Color computes a non- 
negative weight assignment x v (I) on the independent sets of 1 that satisfies 
Properties 1, 2, and 3 at v and Property 4. This is done in the following way: 



— For each independent set 7' of If and if and each path p of Mf, we set to 

x v (I) = the weight of the independent set consisting of the 

paths of I' and path p and x v {I') = (1 — a)x p ^(I'). 

— For each independent set I' of If and If and each path p of M®, we set to 
x v (I) = j^MT\Xp( v ){I') the weight of the independent set consisting of the 
paths of /' and path p and x v {I') = (1 — a)x p ^(I'). 

— For each independent set I' of If 4 and each path p of M®, we set to x v (I) = 
L-\M 1 \ x p(y )(^') the weight of the independent set consisting of the paths of 
/' and path p and x v (I') = (1 — pi)x p ^ v \{I'). 

— For each independent set I' of If 3 and each path p of M®, we set to x v (I) = 
L-\M 1 \ x p(y )(^') the weight of the independent set consisting of the paths of 
/' and path p and x v (I') = (1 — P)x p m(I'). 

— For each independent set I' of If 8 and if 7 and each path p of Mf, we set to 
x v (I) = L _|\ f i| ^p(-»)(7') the weight of the independent set consisting of the 
paths of I' and path p and x v (I') = (1 — 7 )x p ( v )(I'). 

— For each independent set I' of If 7 and if 8 and each path p of M®, we set to 
x v (I) = jpzfMT-\ x p(v)(I') the weight of the independent set consisting of the 
paths of I' and path p and x v (I') = (1 — 7 )x p ( v )(I'). 

— Let 

ki = 'y ] x p ^(i) 

/e/°u/ 7 u/ 8 u/ 78 



and 



k'2 



2n(L — \Mf\) + 



D(L-\Ml\f 

L 



— For each independent set /' of if U If U if U if 8 and each path p of Mf and 
M®, we set to x v (I) = max {fc t k 2 } x p ( v )(-0 the weight of the independent set 
consisting of the paths of /' and path p. 
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— For each independent set 7' of 7° U if U 7® U if 8 and each pair of paths p , q of 

and M® respectively, we set to x v (I) = max{ ^ fc2 } L z P (>)(7') the weight 
of the independent set consisting of the paths of 7' and paths p and q. 

— For each independent set 7 of 7® U if U 7® U if 8 we set x v (I) = max{0, 1 — 

kl\ x p(v){I)- 

— For each path p of M® and A7®, we set to x v (I) = nmax{0,l — |^} the 
weight of the independent set consisting only of p. 

— For each pair of paths p, q of M® and A7® respectively, we set to x v (I) = 
j; max{0, 1 — j^} the weight of the independent set consisting only of p and 
Q- 

— For all other independent sets 7 (i.e., independent sets that contain at least 
one path touching only nodes with label greater than As) we set x v (I) = 0. 



Now, we have to set values to the parameters a, (3, 7, n so that Fract-Color 
is correct (i.e., it matches the input-output specification given in the previous 
section). The next lemma gives sufficient conditions for this purpose. 

Lemma 1 (Correctness conditions). If 



0 < a, /3, 7 < 1 
n > 0 

aL( 1 - 77) + (3\Mf \D + 7(7, - 2\Mf\)D = D(L - \ M$\) 
-ml\ 2 D + nL(L - \Ml\) = {L- - D)L - £>^1) 

D 2 - 477 



2n(L — \Mf\) < 



377 



-L - (4 - 277) | | + 3D ^ 2 

Ju 



(1) 

(2) 

( 3 ) 

( 4 ) 

( 5 ) 



then Fract-Color is correct. 



Proof. It can be easily verified that conditions (1) and (2) and the fact that the 
weight assignment x p ( v ) is non-negative guarantee that x v is non-negative. Also, 
by the definition of Fract-Color, x v satisfies Property 3. 

By making calculations, it can be verified that 

^27(7) = ^x p(v )(I) + max{0 ,k 2 - fa}. 
iex iex 

If k\ > k- 2 , x v satisfies Property 4 since x p r v ) satisfies Property 4. If k% < k- 2 , 
condition (5) and the fact that x p ^ v ) satisfies Property 4 guarantee that x v satisfy 
Property 4 as well. 

Now, in order to prove Properties 1 and 2 at node v, we have to prove that 
for any edge e between a node u with label at most the label of v and a child of 
u, for any path p traversing e, it is Yliei p ^i x v(I) = 1 — 77 and for any set of 
paths p. q traversing e in opposite directions, it is p qei x v(I) ~ D/L. We 

have to distinguish between two cases for e: (i) e is an edge between a node u 
with label smaller than v’s and a child of u and (ii) e is an edge between v and 
a child of v. 
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First, we give the proof for case (i). Given an independent set I of I v , we 
denote by J V (I) the set of independent sets that contain all paths of I , (possibly) 
paths of M/ and Mf , , and no other paths. Since x v satisfies Property 3 at node 
v, it is x v (I) = 0 for any independent set / not belonging to (J jg/ J V (I). By 
carefully reading the procedure Fract-Color, we can see that the weight of I 
under x p ( vj equals the sum of weights of the independent sets in </„(/) under x v . 
Also, since a? p („) satisfies Property 3 at p(v), it is x p / v ) = 0 for all independent 
sets not belonging to I v . Furthermore, x p ( v ) satisfies Properties 1 and 2 at p(v). 
We obtain that 

E x v( f ) = E E Xv n) 

iei -.pel iei n/ -.pel I'EJ (/) 

= y i x p( v) (-0 

iei n/ -.pel 

= y ' Xp^(i) — i — d 

iei -.pel 

and 

^2 x v( i )= E E ^( J/ ) 

IEI :p,qEl IEI n/ :p,qEl I’EJ (I) 

y ] x p(v){i) 

IEI n I :p,qEl 

= y ^ Xp^y^I) = D/L. 

IEI '.p.qEl 

Proving case (ii) is lengthy. We have to prove it for e G {(u, l(v)), ( v , r(u))}, 
and, in each case, we have to dinstinguish between subcases for paths p and q. 
We will just show that for edge e = (v,l(v)) for a path p G and a path 
q G Ml, it is YI IgI :p , geJ *«(■?) = D/L. 



E x «( J ) 

IEI :p.qEl 



E 



/ e i 1 ■■ q e i 
/' e J (/) : p e /' 



E 

I e I 18 :qe I 
/' e j (/) -.pel’ 

a 



x v (I r ) + 
x v (I') 



E 



An 



I E Z 14 : q 6 / 

I' E J (I) : p £ I’ 



L-\Ml\ E *P(»)( J )+ L _| M 1| E ^G)( J ) 



iE^'.qEl 



lEl 14 :qEl 



L-\Ml\ E a ’pG)( J ) 

1 vl lEl 18 :qEl 

aL(l - D) + + 7 i9(L - 2|A^|) 

D/L 
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The last equality follows by condition (3). Conditions (3) and (4) are enough 
to prove all possible subcases. □ 

In order to prove that Fract-Color is correct, we will compute values for 
a,j3,j,n such that the correctness conditions (l)-(5) of Lemma 1 are satisfied. 
We distinguish between two cases according to the size of M\. 

CASE I. Ml | > ^=p-L. j n thjg case, we have the following settings 

a = 1 

(L-\Ml\)(D\Ml\-(l-D)L) 

7 \M}\D 

n = 0 

Clearly, a satisfies condition (1) and n satisfies condition (2). By simple 
calculations, we obtain that (3), (4), and (5) are also satisfied. Since |M/| > 
it is /? > 0. Also, 

(L - \Ml\)(D\Ml\ - (1 - D)L) - D\Ml\ 2 

^ D\Ml | 2 

_ -2D\Ml\ 2 + L\Ml\ - (1 - D)L 2 
D\Ml \ 2 

The expression in the last line is always non-positive provided that D £ 2/3, 

2+ 4 v ^ . Thus, /3 satisfies condition (1). Also, since < |M/| < L/ 2, 7 

satisfies ( 1 ). 




Clearly, (3 and 7 satisfy condition (1). By making simple calculations, we ob- 
tain that (3), (4), and (5) are satisfied. Since |M/| < n satisfies condition 

( 2 ). Also, since 0 < |M/| < ^-jpL, a satisfies condition ( 1 ). 



6 Extensions to the Path Coloring Problem 

In this section we outline our path coloring algorithm. Similar ideas are used in 
[ 1 ] to prove a weaker result for general (i.e., non-locally-symmetric) sets of paths 
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on binary trees. Let T be a binary tree and V a normal locally-symmetric set of 
paths on T. Our algorithm uses a rational parameter D € [2/3, 2+ /‘ 2 ) ■ We start 
with a few definitions. 

Definition 5. Let v be a node of T different from the root and V a normal 
locally-symmetric set of paths. A probability distribution Q over all proper color- 
ings of paths ofV traversing the edge e between v and its parent with (2 — D)L 
colors is weakly uniform if for any two paths p,q € V that traverse e in opposite 
directions, the probability that p and q have the same color is D / L. 

Let v be a non-leaf node of T and let C be a coloring of paths of V that 
traverse the edge e between v and its parent. Let y be a color used by C. x is 
called single color if it is used in only one path traversing e and double color if 
it is used in two paths traversing e in opposite directions. We denote by A l v the 
set of single colors assigned to paths in M*, and by A l J the set of double colors 
assigned to paths in M* and Mf. 

Definition 6. Let v be a non-leaf node of T and V a normal locally-symmetric 
set of paths. A weakly uniform probability distribution Q over all proper colorings 
of paths ofV traversing the edge e between v and its parent with (2 — D)L colors 
is strongly uniform if 

Kl = L 

for any pair i,j such that paths of M* traverse e in opposite direction to paths 

of Ml. 

First, the algorithm performs the following initialization step to produce a 
random coloring of the paths touching r with (2 — D)L colors according to the 
weakly uniform probability distribution, ft colors the paths originating from the 
root r. ft selects uniformly at random without replacement DL colors of the L 
colors used to color the paths originating from r and assigns them to DL paths 
which are selected uniformly at random without replacement from the L paths 
terminating at r. ft also assigns (1 — D)L new colors to the paths terminating 
at r which have not been colored. 

Then, for i = 1, . . . , n — 1, the algorithm executes the procedures Recolor 
and Color at node v with label i. 

The procedure Recolor at a node v takes as input a random coloring C v 
of the paths traversing the edge between v and its parent with (2 — D)L colors 
according to the weakly uniform probability distribution. Recolor stops if v is 
a leaf. Otherwise, it produces a new random coloring C' v of the paths traversing 
the edge between v and its parent with (2 — D)L colors according to the strongly 
uniform probability distribution. 

The procedure Color at a node v takes as input a random coloring C' v of 
the paths traversing the edge between v and its parent with (2 — D)L colors 
according to the strongly uniform probability distribution. Color stops if v is 
a leaf. Otherwise, it produces a random coloring C" of the paths touching node 
v in such a way that: 
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— The restriction C;(„) of C" on the paths traversing the edge between v and l(v) 
is a random coloring with (2 — D)L colors according to the weakly uniform 
probability distribution. 

— The restriction C r („) of C" on the paths traversing the edge between v and 
r(v) is a random coloring with (2 — D)L colors according to the weakly 
uniform probability distribution. 

— The coloring C" uses at most 4D colors. 

Finally, the algorithm uses a deterministic algorithm to color the set of paths 
whose color has been changed by Recolor. 

In the following we briefly discuss the main ideas used for the precise def- 
inition of procedures Recolor and Color. We remark that the procedures 
Recolor and Color can be implemented to run in time polynomial in L, 
thus, our algorithm runs in time polynomial in L and the number of nodes of 
the tree. 

Let v be a non-leaf node of T . Observe that if C v is a random coloring of the 
paths traversing the edge between v and its parent, then the numbers \A l f\ are 
random variables with expectation 



em\ = |Mill ^ |£) . 

jL/ 

In [1], it is shown that | A l f | ! s follow a hypergeometrical-like distribution and are 
sharply concentrated around their expectations. The procedure Recolor uses 
this property and, with high probability, by recoloring a small number of paths 
traversing the edge between v and its parent, it produces a random coloring C' v 
which follows the strongly uniform probability distribution. This is done in such 
a way that after all executions of Recolor the load of the paths of V whose 
color has been changed at least once is o(L) with high probability. 

The procedure Color mimics Fract-Color in some sense. During its ex- 
ecution at a non-leaf node v such that P v belongs to Scenario I (the case of 
Scenario II is similar to Scenario II of Fract-Color) it colors the paths of M ® 
and M ® using colors of A l v and A l J in such a way that the probability that two 
paths traversing one of the edges between v and a child of v in opposite directions 
are assigned the same color equals D/L (recall that this equals the sum of the 
weights x v produced by Fract-Color of the independent sets containing both 
paths). The analysis is much similar to the analysis of Fract-Color. By work- 
ing on the details of the analysis, we conclude to similar correctness conditions 
with those of Fract-Color. 

We now state our result for the path coloring of locally-symmetric sets of 
paths. 

Theorem 2. There exists a randomized polynomial-time algorithm which, for 
any constant S < 1/3, colors any locally-symmetric set of paths of load L on 
a binary tree of depth at most L s / 8, using at most 1.367 L + o(L) colors, with 
probability at least 1 — exp (— 12(7/)). 
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The restriction in the depth of the tree is necessary so that the upper bound 
on the number of colors is guaranteed with high probability. The number of 
nodes of the tree may be exponential in L (i.e., up to 2 L / 8 ). 
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Abstract. We consider the problem of finding a schedule for n identical 
malleable tasks on p identical processors with minimal completion time. 
This problem arises while using the branch & bound or the divide & 
conquer strategy to solve a problem on a parallel system. If nothing is 
known about the sub-problems, then they are assumed to be identical. 
We assume that the execution time decreases with the number of proces- 
sors while the computational work increases. We give an algorithm with 
running time exponential in p which computes an optimal schedule. In 
order to approximate an optimal schedule, we use the concept of phase- 
by-phase schedules. Here schedules consist of phases in which every job 
uses the same number of processors. We prove that one can approximate 
an optimal schedule up to a factor J using constant time, and we show 
that this is optimal. Furthermore, we give an e-approximation algorithm 
if the speed-up is optimal up to a constant factor. 



1 Introduction 

Motivation-Framework. The multiprocessor scheduling problem for identical 
processors is well-known and has been studied extensively in many different vari- 
ations. If the number of processors on which a specific job has to be executed 
is part of the input, then the tasks are called non-malleable. Otherwise they 
are called malleable. Likewise, if the jobs may be interrupted while being exe- 
cuted, then the resulting schedule is called preemptive, otherwise it is called non- 
preemptive. Furthermore, the specification depends on precedence constraints 
between jobs, and on the objective. Veltman et al. [22] give an overview of a 
multitude of works on various specifications. 

In this paper we consider the problem of finding a non-preemptive schedule 
for n identical jobs on p identical processors with minimal total completion time, 
the so-called makespan. Since the jobs are identical, the time function is equal 

* Partly supported by the DFG-Sonderforschungsbereich 376 Massive Parallelitat: Al- 
gorithmen, Entwurfsmethoden, Anwendungen, and by the 1ST Programme of the 
EU under contract number IST-1999- 14186 (ALCOM-FT). 



R. Solis-Oba and K. Jansen (Eds.): WAOA 2003, LNCS 2909, pp. 95—108, 2004. 
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for all jobs, that is, the execution time on any certain number of processors is 
the same for all jobs. 

Using the branch & bound or the divide & conquer strategy to solve a prob- 
lem, the problem is split into smaller subproblems which have to be solved, that 
is, tasks which have to be executed. In many cases the parallelism given by the 
branch & bound tree or by the divide & conquer tree is sufficient to yield a good 
speed-up. But in many other cases this is not true and we have to parallelize the 
computations done at the tree nodes. All these computations are of the same 
kind and so we may assume that all these tasks are identical. In this case the 
scheduling problem we consider applies. Our motivation for doing research on 
this problem is that this scheduling problem arises isolating real roots in parallel 
using the Descartes method [8]. As far as we know there is no publication on 
this scheduling problem. We assume that the same properties for the execution 
time as in [3] hold. This implies that the execution of the jobs achieves some 
speed-up, but no super-linear speed-up. Up to now, it is not clear whether the 
corresponding decision problem is MV- hard or not. 

Related work. There exist many results on the complexity of various schedul- 
ing problems [5]. Du and Leung [10] showed that both the non-malleable and 
the malleable scheduling problem are MV- hard, so researchers are interested in 
approximation algorithms. If the tasks are non-malleable, then the scheduling 
problem is a special case of the resource constraint scheduling problem. Hence 
the optimal schedule can be approximated up to factor 2 using list planning [12]. 

Krishnamurti and Ma [17] were the first who did research on the schedul- 
ing problem with malleable tasks. Belklrale and Banerjee [3] introduced an al- 
gorithm with approximation factor 2/(1 — 1/p), assuming that execution time 
decreases with the number of processors while the computational work increases. 
Turek et al. [21] improved this result, using no assumptions, and showed an ap- 
proximation factor 2. The latest result is from Mounie et al. [18]. Using the same 
assumptions for the execution time as Belklrale and Banerjee [3], they proved 
that an optimal schedule for malleable tasks can be approximated up to the fac- 
tor %/3- Blazewicz et al. [4] improved this result to 2, and a proof for the factor 
| has been submitted for publication [19]. 

The best known approximation algorithm for schedules where jobs are only 
allowed to be executed on processors with successive indices is from Stein- 
berg [20]. He showed an approximation factor 2. Note that this scheduling prob- 
lem is closely related to the orthogonal packing problem of rectangles first inves- 
tigated by Baker et al. [1]. See [11,2, 16] for a typology of cutting and packing 
problems and results on approximation algorithms. 

Jansen and Porkolab [14] invented the first polynomial approximation algo- 
rithm for a constant number of processors using linear programming. Note that 
this problem is related to orthogonal strip packing of rectangles [6, 15]. Further- 
more, Jansen [13] proposed an asymptotic fully polynomial time approximation 
scheme if p is part of the input. 
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Results. In Section 2 we formally define the scheduling problem and introduce 
an algorithm which computes a schedule with minimal makespan. We show that 
its running time is exponential in the number p of processors. If p is constant, 
then this yields an algorithm polynomial in the number n of jobs. However, 
the algorithm is not suitable for practical purposes. In order to approximate 
an optimal schedule, we introduce phase- by-plrase schedules in Section 3. Here 
schedules have a simple structure. They consist of phases in which each job 
uses the same number of processors. A new phase can not be started until the 
last phase has finished. We illustrate with help of an example that the quotient 
of the makespan of an optimal plrase-by-phase schedule and the makespan of 
an optimal schedule can be |. Furthermore, we give a constant time algorithm 
which only uses certain phase-by-plrase schedules providing an approximation 
factor |. In order to prove this approximation factor, we prove two technical 
lemmas in Section 4, providing good lower bounds on the makespan of an optimal 
schedule as well as restricting the set of schedules which have to be considered to 
dominating schedules. In Sections 5 we use these lemmas to prove the correctness 
of the algorithm for the case p < n < [|pj by case analysis. Due to lack of space 
the proofs of the other cases are omitted here. They can be done in a similar 
way and can be found in [9]. Finally, we give an e-approximation algorithm in 
the case that the speed-up is optimal up to a constant factor in Section 6. 

2 Optimal Schedules 

For the malleable scheduling problem the input consists of n, p and t : n is the 
number of identical jobs which can be executed on different number of processors 
in parallel, p is the number of identical processors on which the jobs are to be 
scheduled, t is the time function given by t : {1, . . . ,p} — R + . Here t(i) is the 
running time needed to compute a job on i processors. For 1 < i < j < p the 
time function t must have the following properties: 

— monotony: t(j) < t(i) 

— speed-up property: i ■ t(i ) < j ' t{j ) 

These properties imply that the execution of the jobs may achieve some speed- 
up, but no super-linear speed-up. 

A schedule S assigns a set a, of processors and a starting time t* to every 
job i such that all jobs are executed and every processor executes at most one 
job at any time. We call (cr,, t,) the plan according to job i. The objective is to 
minimize the makespan defined by 

max{r + f(|oj) | (cr, r) G S}. 

A schedule with minimal makespan is called optimal schedule. 

Our next goal is to find an algorithm to compute an optimal schedule. Some of 
these results are also in [7]. In order to proof the correctness of our algorithm, we 
first show that there always exists an optimal schedule with certain properties. 
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Definition 1. 

1. The load vector l(S) = (h, . . . ,l p ) of a schedule S is defined by 

li = max{r + f(|oj) | (cr, r) £ S A i £ a}, 1 < i < p. 

2. The sorted load vector A(S') = (Ai, . . . , \ p ) is given by A,; = l n (i), 1 <i< 
p, where ir is a permutation with 1^^ < ln(j) for all 1 < i < j < p. Note 
that A p is the makespan of S. 

We will show that for each schedule there exists another schedule with the 
same makespan and certain properties. One of this properties is that each job 
in a schedule starts either at time r = 0 or directly subsequent to another job. 
We now define this property formally. 

Definition 2. A schedule S is packed if for all plans (cr, r) £ S either r = 0 
holds, or there exists another pair ( <j',t ') £ S with crficr' ^ 0 and r , +f(|cr , |) = r. 

Definition 3. Let S be a schedule for n jobs. Then S contains a schedule S' if 
S' C S. The schedules Si , . . . , S n -i are called intermediary schedules of S 
if Si C ... C S n -i C S and |Si| = i for all 1 < i < n. 

In general we can compute a schedule for i > 1 jobs by using a schedule for 
i— 1 jobs and assigning a set of free processors to the zth job. We use this fact 
to give an algorithm. Every step leads to a set of intermediary schedules. 

Lemma 1. For each optimal schedule S for n jobs there exists a schedule S n 
and intermediary schedules Si , ... , S n -i of S n such that for 1 < i < n we have 

1. S and S n have the same makespan. 

2. Si is packed, and for the sorted load vector (A^\ . . . , A p •*) of Si the inequality 
A p — A^ < t(l) holds. No job in Si is finished later than in S. 

Proof. Let 1 < i < n, and let Si_i be a packed intermediary schedule. Let j be 
a processor with l *. -1 = A 1 , i.e. j has minimal load among all processors in 
Si-i. Choose a plan (a, r) £ S with j £ a and minimal starting time which has 
not been considered yet. Denote S' = Sj_i U {(cr, r)}. 

If S' is packed then we set Si := S', and we are done. Otherwise let r* 
be the earliest time at which all processors in a in Si-i are ready. Then S* := 
-S' i _iU{(cr, r*)} is packed, and S*L)(S\S’) is an optimal schedule. Hence Si := S* 
is an intermediary schedule. 

If the makespan of S' is larger than A j^ 1 + t(l), then remove (cr, r) from the 
schedule and add the plan ({j}, l)~ )■ Proceed in the same way if there exists no 
(cr, r) £ S \ Si-i with j £ cr and the makespan of S is larger than Aj _1 + f(l). If 
no such (cr, r) exists and the makespan of S is at most A{ _1 + f(l) then we may 
proceed with any plan from S \ Si - 1 with minimal starting time. □ 

We get optimal schedules by computing intermediary schedules of optimal 
schedules iteratively. Due to Lemma 1 we can restrict our search to packed 
schedules with sorted load vector (Ai, . . . , X p ) with 0 < \ p — Ai < f(l). 
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In the following we will present optimal algorithms. We do so for time func- 
tions t : N — > R + and t : N — > N. In both cases we assume that manipulating 
numbers can be done in constant time. Applying the Turing machine model 
would result in an additional factor 0 (log (makespan)) = 0(log(n) + log(f(l))) if 
N is the range of the time function. Note that the jobs are identical and therefore 
the number n of jobs contributes only log(n) bits to the input size. 

Lemma 2. If a schedule S is packed, then each entry of the load vector A (S) = 
(Ai, . . . , A p ) can be written as a linear combination 

A i £ (aif(l) H b a p t{p ) | «i, e {()..... n}}. 



Theorem 1. 

1. If the range of the time function t is K + , then there exists an algorithm which 
computes an optimal schedule using 0(n(n + l) p 2 P ) time. 

2. If the range of the time function t is N, then there exist algorithms which 
compute an optimal schedule using 0(log(n) • |P| 3 ) or 0(n ■ |Vj • 2 P ) time, 
where \V\ = t(l) p . 

Proof. We can view the scheduling problem as a constrained shortest path prob- 
lem in a directed graph. For a sorted load vector A (S) = (Ai , . . . , X p ) of a schedule 
S we denote with yl(S) = (Aj — Ai)j = i v .. )P the pattern of S. In the directed graph 
every possible pattern corresponds to a node. Let V be the set of nodes. A di- 
rected edge from node i to node j exists if and only if we can get a schedule with 
the pattern corresponding to node j by adding a plan to a schedule with the 
pattern corresponding to node i. We assign to each edge (i, j) the augmentation 
of the makespan gained by adding the corresponding plan. Now we can compute 
the minimal makespan by searching a path with n edges having minimal weight. 

1. If the range of the time function t is R + , then Lemma 2 yields \V\ < (n+ l) p2 . 
Furthermore we know that the graph has an out-degree at most 2 P . We use 
dynamic programming to compute all paths with i edges for 1 < i < n. So 
we need at most 0(n(n+ l) p 2 P ) time. 

2. If the range of the time function t is N, then we have |Vj = t( l) p . We can 

compute a minimal weight path with n edges in 0(log(n) • |I^| 3 ) time using 
the doubling-technique. The time complexity of the second claim can be 
proved in the same way as in part (1) using the fact that we only have to 
consider |Vj = f(l) p nodes for 1 < * < n. □ 

We have proved that it is possible to compute an optimal schedule using time 
polynomial in n if the number of processors is constant. However, the running 
time depends exponentially on the number of processors. 



3 Phase-by-Phase Schedules (PPS) 

We will use a phase-by-phase schedule (PPS) to approximate an optimal 
schedule. Here a schedule consists of phases in which each job uses the same 
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number of processors. A new phase can not be started until the last phase has 
finished. Hence we do not have to store plans for all jobs but may write a phase- 
by-phase schedule with k phases as PPS = where ij denotes the 

number of processors used in phase j. The makespan is 

k 

makespan(PPS') = E # ^)- 

3 = 1 

Decker and Krandick [8] introduced an algorithm which computes an optimal 
phase-by-plrase schedule using 0(n 2 ) time. The running time of this algorithm 
can be improved to C3(?r-min{n,p}). We now show that there exists an algorithm 
which computes an optimal plrase-by-phase schedule using 0{p 3 + log(n)) time. 

Theorem 2. There exists an algorithm which computes an optimal phase-by- 
phase schedule using 0(p 3 + log(n)) time. 

Proof. Let Xi be the number of phases using i processors. Due to the speed-up 
property Xi < i — 1 holds for i > 2. Furthermore at most |_f J jobs are executed 
during such a phase. Therefore at most 



E 

2<i<p 



P 
- i 



- < E (* ~ 1 ) - <{p~ i )p 

i- t— ' Lz J 



2<i<p 



jobs are executed in phases using more than one processor. 

The algorithm works as follows. If n > (p — 1 )p, then compute the minimal r 
such that n — r • p < (p— 1 )p. The schedule starts with r sequential phases. The 
improved algorithm from [8] is used to compute the schedule for the remaining 
n — r ■ p jobs. □ 

Decker [7] showed that the makespan of an optimal plrase-by-plrase schedule 
is at most twice as large as the makespan of an optimal schedule. The following 
example illustrates that computing an optimal plrase-by-plrase schedule can not 
lead to an approximation factor lower than |. 

Example 1. Let p = 5 and n = 3. Furthermore let f(l) = 1, t{ 2) = | and 
t(i) = ^ for all 3 < i < 5. The optimal plrase-by-plrase schedule is (2, 5) with 
makespan t(2) + f(5) = |. However, er = {({1, 2, 3}, 0), ({1, 2, 3}, |), ({4, 5}, 0)} 
is an optimal schedule with makespan 2f(3) = |. So the approximation factor 
via plrase-by-plrase schedules is |. Note that this example can be extended to 
p = 3j + 2 and n = 2 j + 1 with j £ N. 

Our next goal is to find an algorithm with approximation factor | using 
constant time. The idea of our algorithm is as follows: we compute a solution 
only depending on the relation between p and n. We do not know anything about 
the time function (up to the monotony and the speed-up property), and we will 
consider a plrase-by-plrase schedule. In the following we show that this algorithm 
provides an approximation factor of | . For the sake of readability we only return 
the makespan of the plrase-by-plrase schedule which we choose. Note that the 
plrase-by-plrase schedule is returned, too. The algorithm is as follows: 
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Algorithm 1 Phase-by-Phase Schedule (PPS) 

Input: number of jobs n, number of processors p, time function t 
Output: phase-by-phase schedule 




(1) 


begin 




(2) 


if n > p then 




(3) 


if p < n < LfpJ then 




(4) 


«= — 
|_n-pj 




(5) 


return t(l) + t(a) 




(6) 


if LfpJ < n < 2p then 




(V 


a ~ [n-lipll 




(8) 


return min{t(l) + t( 2) + t(a), 2£(1)} 




(9) 


if '2p < n < [fpj then 




(10) 


“-trfsj 




(11) 


return 2£(1) + t(a) 




(12) 


if [fpj < n < 3p then 




(13) 


“=U%iJ 




(W 


return min{2f(l) + t( 2) + f(a), 3£ (1) } 




(15) 


else 




(16) 


a = 1 ' 

[n-L — J-pJ 




(17) 


return [^J -t(l)+t(a) 




(18) 


else 




(19) 


if Lf J < n < p then 




(20) 


“-L-U 




(21) 


if 2[|J < n < p then 




(22) 


return min{t(l), f(2) + f(a)} 




(23) 


if LfJ + LfJ <«<2LfJ then 




(24) 


return min{f(l), t(2) + t(a), 2t(3)} 




(25) 


else 




(26) 


return min{t(l), t(2) + t(a),t( 3) + t(4)} 




(27) 


if Lf J <n< Lf J then 




(28) 


“=Ua,J 




(29) 


if < n < LfJ then 




(30) 


return min{t(2), f(3) + t(a)} 




(31) 


if 2Lf J < n < |p then 




(32) 


if p < 17 then 




(33) 


return min{t(2), t( 3) + t(a), t( 4) + t( 6)} 




(34) 


else 




(35) 


return min{f(2), t(3) + t(a), t(4) + t(5)} 




(36) 


else 




(37) 


return min{f(2), £ (3) + t(a), 2£(5) } 




(38) 


if LfJ < n < LfJ then 




(39) 


“=U,,J 




(40) 


if < n < LfJ then 




(41) 


return min{f(3), £(4) + f(a)} 




(42) 


if 2 L y J < n < |p then 
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Us) 

(44) 

(45) 

(46) 

(47) 

(48) 

(49) 

(50) 

(51) 

(52) 

(53) 

(54) 



end 



if rem(p, 7) = 5 and p < 36 then 

return min{£(3), t(4) + t(a),t( 5) + t(9), t(6) + t( 7)} 
if rem(p, 7) = 6 and p < 37 then 

return min{t(3), t(4) + t(o), t(5) + t(ll), t(6) + t(7)} 

else 

return min{£(3),£(4) +t(a),t( 6) + £(7)} 

else 

return min{£(3), £(4) + t(o), 2£(7)} 
if LfJ < n < LfcZuJ an< 3 > 5 then 

return min {t(k — 1 ),t(k) + £(a)} 



We now show that Algorithm 1 approximates an optimal schedule up to a 
factor | using constant time. Note that applying the Turing machine model 
would result in a time bound 0(log(n) +log(p) + log(£(l))). The proof for n < p 
is more complex, mostly since we aimed to get the real approximation factor for 
this interval. Thus the algorithm for n < p is rather complex. The approximation 
factor for the simple algorithm we use for n > p is |. However, we think that 
by using a more complex algorithm, we could get a better result. The example 
shown in Figure 1 demonstrates that for n > p the approximation factor is at 
least |. 
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Fig. 1 . Optimal schedule and optimal phase-by-phase schedule for p = 11, n = 12, 
and t(l) = 1, f(2) = | and t(i) = | for all 3 < i < 11. The makespan of the optimal 
schedules is t(2) +2t(3) = |, and the makespan of the optimal phase-by-phase schedule 
is t(l) +t(3) = |. So the approximation factor is |. 



Theorem 3. Algorithm 1 computes a phase-by-phase schedule which is an op- 
timal schedule up to a factor | using constant time. If n > p then the approxi- 
mation factor is | . 

Proof. For p < n < [fpj the claim follows with Lemma 5. Due to lack of space 
the proofs for the other cases are omitted here. They can be done in a similar 
way and can be found in [9]. 
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4 Technical Lemmas 

In the following we denote with Topt the makespan of an optimal schedule. In 
order to prove Theorem 3 we mainly consider the quotient of the upper bound 
provided by Algorithm 1 and a lower bound for Topt- The following lemma will 
help us to find good lower bounds. 

Lemma 3. Let 1 < Mi < u 2 < P, ^ < n, and let the makespan of an 

optimal schedule consist of at least two addends. 

1. If t(ui) is an addend in the makespan of an optimal schedule, then Tqpt > 
t(ui) + t(u 2 ). 

2. If u 2 < Mi + 1 and no t(i), i < u\, is an addend in the makespan of an 
optimal schedule, then Topt > t(ui) + t.(u 2 ). 

Proof. 

1. Let t(u±) + be the makespan of an optimal schedule. If there 

exists k t < u 2 , then we are done. Otherwise the speed-up property yields 

t(ui) + ^2 t(ki) > t(ui) + ^2 ’ t( u 2 ) = t( u 1 ) + — ’ t( u 2 ) ^2 T 

1 <i<l 1 <i<l P 1 <i<l 

> t(u 1 ) -I — - • t(u 2 ) ■ ( n — — ) > t(ui) + t(u 2 ). 
p \ u l) 

2. Let £(&») be the makespan of an optimal schedule. If t(ui) is an 

addend in this makespan, then we are done. Otherwise we have fc,; > u 2 for 
all 1 < i < l. We get 

t(ki) > ^2 t 2 - ' = — ■ t(u 2 ) ■ f > —t{u 2 ) ■ n 

1 <i<i Ki p 1 < t <i p 

> — ■ t(u 2 ) ( — + — ^ = tfu 2 ) -\ — - • t(u 2 ) > t(ui) + t{u 2 ). 

p \Ul U 2 J Ml 

This completes the proof of the lemma. □ 

We have to consider all makespans of possible schedules. Unfortunately the 
number of such makespans can become very large. So finding a good lower bound 
is difficult. The next definition helps us to restrict the number of makespans we 
have to consider. 

Definition 4. Let S,T be two packed schedules, and let the sums Xa<z<fci 
and Ei<K fe2 Tnakespans of S and T, respectively. Then S dominates 

T if for all valid time functions t, that is, time functions for which the monotony 
and the speed-up property hold, 

l<z<fci 1 <l<k 2 
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Lemma 4. Let S be a schedule, r £ Q + , n > p-r, and let I C {to £ N | in < p} 
be the set of number of processors which are allowed to be used. Then 



To pt > min 



Xit (*) 

iei 




Proof. Let 1 < j < p, and let yi{ j) be the number of jobs in an optimal schedule 
which use i processors including processor j. Now consider a solution to the 
problem. If there exists a j with y r ttAJl > then we are done. Otherwise we 
have 

\ ' \ ' Vi(j) \ ' \ ' Vi(j) , , 

zL 2^—— = zL zL v < p ■ r < n, 

l<j<p i(2l iG/ 

which contradicts the solvability. □ 

Due to Lemma 4 we only have to consider the makespans of dominating schedules 
for which Yliei T > r - 

In order to find another way to get a lower bound on Tqpti we use the speed-up 
property as follows: if each job in the schedule uses at least j processors, then 
each job needs at least j ■ t(j) area. This leads to the lower bound 

T 0 pt > ~ ■ (j ■ t(j)). 

P 

In the following this technique will be helpful. 



5 Proof for the Case p < n < [fpj 



Lemma 5. Let p < n < [fpj • Then Algorithm 1 computes a phase-by-phase 
schedule which is an optimal schedule up to a factor | . 



Proof. Let a = Since n < [|pj we have a > 2. Algorithm 1 returns 

t(l) + t(a). Note that t(l) is obviously a lower bound on the makespan of an 
optimal schedule. 

If t(a) < |f(l) we get 

f (1) + t(a) ^ 6 
*( 1) " 5‘ 



So let t(a) > |t(l). 

First case: There exist no more than p jobs each being executed on at most a 
processors. 

Using the speed-up property yields 



Ti — T) 

Topt > ~{p ■ t(l) + (n — p)(a + 1) • t(a + 1)) = f(l) -I • (a + 1) • t(a + 1) 

p p 



— t(l) + 



n — p 
P 



P 



n — p 



+ 1 



AtWA— t(a). 



ti^a -\- 1) > t(l) ~\~ t(a + 1) 
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Due to monotony of the time function we get 

t(l) + t(a) ^ t(l) + t(a) ^ t(l) + f(l) ^2 6 

*(i) + 4r f (“) “ M+W) ~ M+W) ~ i+| = 5' 

Second case: There exist at least p+ 1 jobs each being executed on at most a 
processors. 

We only have to consider dominating schedules for which t(a) is the smallest 
addend which appears in the makespan. Since t(a) > |f(l), we have Topt > 
6 t(a) > t(l) + t(a) for all makespans of a schedule with more than 5 addends. If 
we have 5 addends then Topt > 5 t(a) > |f(l) + t(a), and we get 

t(l) + t(a) < 1 + § = 6 

+ l + l 5‘ 

Hence we only have to consider makespans with at most 4 addends. The dom- 
inating schedules are those corresponding to the makespans t( 1) + t(a), t( 2) + 
t(3) + t(a), t(2) + 1( 5) + 2 t(a) for a > 4, and 2t(3) + 2 t(a) for a > 3. 
i(l) + t(a): 

We immediately get 

t(l)+t(a) = 1 
t(l)+t(a) 

t( 2) + f(3) + f(a): 

For a < 2 we immediately get the approximation factor 1. For a = 3 we have 

*(l) + *(3) < *(l) + *(3) < Ijtl < 8 

i(2) + 2i(3) S' §i(l) + 2t(3) “ \ + l ~ T 

For a > 4 we get 

t(l) + t(a) ^ t(l) + t(a) ^ t(l) + |t(l) ^ 6 30 6 

t(2)+t(3) + t(a) ~ (i + |)t(l)+t(a) < |t(l) + |t(l) “ 5 ' 31 < 5‘ 

t(2) + f(5) + 2£(a): 

We have 

f(l) + t(a) ^ t(l) + t{a) ^ t(l) + gt(l) 6 10 6 

t(2) + i(5) + 2t(a) - (§ + ±)t(l) + 2t(a) < £i(l) + §i(l) = 5 ' IT < 5' 

2t(3) + 2t(a): 

It follows that 

t(l) + t(a) ^ f(l) + t(a) ^ t(l) + |t(l) 6 15 ^ 6 

2t(3) + 2 t(a) ~ §t(l)+2t(a) < §t(l) + §t(l) = 5 ' 16 < 5' 



This completes the proof. 



□ 
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6 An ^-Approximation Algorithm 

In the previous section we proved that there exists a 0(log(n))-time algorithm 
with approximation factor | . We now show that if the speed-up is good enough 
and p is large enough, then an optimal plrase-by-plrase schedule approximates 
an optimal schedule up to a factor 1 + e. Using r = ^ we denote 

{ k k 

22 yrfW 22~ - r 

i= 1 i—1 

Note that T p is a lower bound on the makespan of an optimal schedule (see 
Section 5). The following lemma is the key to our algorithm. 

Lemma 6. For 1 < k < p we have T p > 71- — t(k). 

Proof. Denote Y^i=k+i \ = I + ^ with a € No and 0 < b < Furthermore, let 



Vi if i < k 

Uk + a + 1 if i = k 



This implies 

E z i \ ' Vi . Uk + O, + 1 V— ' Hi ^^hk+l<i<p \ \ ' Vi . 

7= £ 7 + — ^7 + T^ = 2f7 i ’'- 

l<k<p l<i<k— 1 2—1 l<z<p 

The speed-up property yields 

p k / p \ k 

t p = 22 &*(*) - 22 yit (*) + ( 22 7 ) k • = 22 Vit w + (f + b ) k ■ 

2=1 2=1 \2=/c+l / 2=1 

k k 

= — (a + 1 )t(fc) + (a + bk)t(k) = Zit(i) + (bk — l)t(fc) 

i=l »=1 

>T k - t(k). 

This completes the proof of the lemma. □ 

If t(i ■ j) < -t(j) , c constant, then the previous lemma yields an approximation 
algorithm. 

Theorem 4. Let t(i ■ j) < ° ■ f(j), c constant. Then the optimal phase-by-phase 
schedule is optimal up to a factor 1 + 2 



Proof. Let s > 0. We will show that the optimal plrase-by-phase schedule is 

3 

optimal up to a factor 1 + e if p > 4 • fg holds. 

n < £ ■ p: We choose the minimal j £ N with -(L- < n < ? . Hence t(i) is an upper 
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bound and t(i + 1) is a lower bound on the makespan of an optimal schedule. 
This yields 

t(i) < t(i) , 1 

t(i + l ) “ 4i '*(*) *’ 

and we are done. 

n> £ ■ p: The coefficients y\ , . . . , y v in the definition of T/ define a phase- by- 
phase schedule. With this schedule we can not handle all jobs. However, due to 
speed-up property we have at most i — 1 phases using i processors. So we have 
at most ]Ci<i<fc(® — 1) = ~ l)fc phases using more than one processor. If 

p > |fc 3 , then the remaining jobs can be executed in exactly one more phase 
using k processors. For an optimal plrase-by-phase schedule with makespan Tpp$ 
using at most k processors we have Tpp$ < T/ + t(k). We choose the minimal 
k £ N with k > and get 

Tpps ^ Tfc + t(k) < Tp + 2f(fc) < Tp + 

< Tp + £T 2 t(l) < Tp + £ • — t(l) < 

V 

This completes the proof. □ 

Remark 1. 

1. Combining Theorems 1, 2 and 4 leads to an £-approximation algorithm. Use 

3 

the optimal phase-by-phase algorithm if p > 4 • ^ holds and the optimal 
algorithm from Theorem 1 otherwise. 

2. Theorem 4 can be generalized to other time functions fulfilling t(i) 0 for 
i — > oo. 
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Abstract. We study the problem of on-line scheduling on two uniformly 
related machines where the on-line algorithm has resources different from 
those of the off-line algorithm. We consider three versions of this problem: 
preemptive semi-online, non-preemptive on-line and preemptive on-line 
scheduling. For all these cases we design algorithms with best possible 
competitive ratios as a function of the machine speeds. 

1 Introduction 

The problem: We consider scheduling on two uniformly related machines with 
resource augmentation. Jobs arrive one by one, and each job has to be assigned 
before the next job arrives. 

We consider three different versions: preemptive semi-online scheduling, non- 
preemptive online scheduling and preemptive online scheduling, where semi- 
online means that jobs must arrive in a decreasing order of size, and preemptive 
means that a job may be cut into a few pieces and scheduled on both machines, 
but two parts of the same job may not be scheduled concurrently on two dif- 
ferent machines. This is allowed to both the on-line algorithm and the off-line 
algorithm. 

The measure: A schedule is measured by makespan , i.e. the maximum comple- 
tion time of any machine. An on-line algorithm is measured by its competitive 
ratio , which is defined C = sup -{ \ , where cr is a list of jobs and ALG(a) 

and OPT(a) are the makespans of the schedules created by the on-line algorithm 
and the optimal off-line algorithm for that list of jobs. 

Preliminaries: We consider cases where the resources may be augmented, i.e., 
the optimal off-line algorithm has two uniformly related machines with possibly 
different speeds, which may be faster or slower than the speeds of the machines of 
the on-line algorithm. In the general case, this means that the on-line algorithm 
has a slow machine with speed a and a fast machine with speed b, and the off-line 
algorithm has a slow machine with speed A and a fast machine with speed B. 

* Research supported in part by the Israel Science Foundation (grant no. 250/01). 
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However, with no loss of generality we may assume the on-line algorithm has a 
slow machine with speed 1/s (s > 1) and a fast machine with speed 1, and the 
off-line algorithm has a slow machine with speed 1 / q (q > 1) and a fast machine 
with speed 1. Note that a machine of speed 1 / z is faster for smaller values of 2. 

This assumption can be made since a schedule with makespan x on machines 
with speeds a,b will have a makespan x/a on machines with speeds 1, b/a. 
In particular this is true for the optimal on-line schedule, thus it is possible to 
assume, without loss of generality, that the on-line algorithm has speeds 1, 1/s = 
a/b. Similarly, it is possible to assume, without loss of generality, that the off-line 
algorithm has speeds 1 ,q = A/B. We use the parameters q and s throughout 
the paper. 

The load of a job of processing time p on a machine of speed 1/z is pz. We 
define load of a machine as the total load of the jobs (or parts of jobs) that are 
scheduled on that machine plus the sum of times when the machine is idle. The 
weight of a machine is the sum of the processing times of jobs (or part of jobs) 
that are scheduled on that machine. 

We slightly abuse the notation by identifying jobs with their sizes (processing 
times) 

Our results: In this paper we find the exact competitive ratio functions for all 
three versions we consider, those are functions of s and q. For the preemptive 
cases, we show they are valid for both a deterministic version and a randomized 
version. We give deterministic algorithms which achieve these competitive ratios. 

We show that the competitive ratio of preemptive semi-online scheduling is 
1 for {q < 3, s < or for {q > 3 ,s < pp}, 29+3 sq f° r < s < 2} and 

{q < 3,s > 2}, and ^+ 1+2 sq for {q > 3,s > pp}. For the last two cases we use 
idle time and prove that it is necessary. 

Moreover, we show that the competitive ratio of non-preemptive online 
scheduling is min | , pp | and the competitive ratio of preemptive on- 
line scheduling is ■ 

Previous work 

On-line scheduling: Scheduling on identical and uniformly related was studied 
since 1966 when Graham introduced his greedy algorithm “List Scheduling” for 
identical machines [20] . The competitive ratio of that algorithm is 2—1/m. Since 
then the was a sequence of improvements on the performance of algorithms [4, 
25, 1] and a sequence of lower bounds [5, 1, 19], and the best current results are 
an algorithm of competitive ratio 1.9201, designed by Fleischer and Wahl [16] 
and a lower bound of 1.88 given by Rudin [28]. For uniformly related machines 
there exist constant competitive algorithms, see [2,6]. The lowest upper bound 
known for the competitive ratio is 5.828 whereas the lower bound is 2.438. For 
two and three identical machines, [15] showed that the greedy algorithm has 
the best possible competitive ratio. The tight result for two uniformly related 
machines (without resource augmentation) is folklore and is given in [13]. The 
competitive ratio is min{ ppy , 1 + for this problem, achieved by a greedy 
algorithm which assigns a job to the machine which would finish it first. 
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Semi-online scheduling: Graham [21] also analyzed the greedy algorithm on 
identical machines for the case where sizes (processing times) of jobs are non- 
increasing getting the result 4/3— l/(3m). The results for two and three machines 
are 7/6 and 11/9. Seiden, Sgall and Woeginger [30] showed lower bounds of 7/6 
and 1.18046 for two and three machines respectively. Note that the bound for two 
machines is tight, which means that the greedy algorithm has the best possible 
competitive ratio for the problem. 

Most of the study for semi-online scheduling on non-identical machines in- 
volves a study of the greedy algorithm. For two machines, Mireault, Orlin and 
Volrra [27] gave a complete analysis of the greedy algorithm as a function of the 
speed ratio. They show that the interval q > 1 is partitioned into nine intervals, 
and introduced a function which gives the competitive ratio in each interval. 
The reference [18] shows that for any speed ratio, the performance ratio of the 
greedy algorithm is at most / (1 + vT7) ss 1.28. This was generalized in [11], In 
that paper, a complete analysis of the best competitive ratio as a function of the 
speed ratio was given. Here there are already fifteen intervals. In most intervals 
the greedy algorithm turns out to be the best algorithm, but there are several 
intervals where other algorithms are designed, and were shown to perform bet- 
ter and to have the best possible competitive ratios. For a general setting of m 
uniformly related machines, Friesen [17] showed that the overall approximation 
ratio of the greedy algorithm is between 1.52 and |. Dobson [9] claimed to im- 
prove the upper bound to ~ 1.58. Unfortunately, his proof does not seem to 
be complete. 

On-line preemptive scheduling: There are several tight results for preemptive 

on-line scheduling, in all cases the lower bounds hold also for randomized algo- 
rithms. Chen, Van Vliet and Woeginger gave a preemptive optimal algorithm 
and a matching lower bound for identical machines [8] . The competitive ratio of 
the algorithm is m m /{m m — ( m — l) m ). Seiden [29] also gave such an optimal 
algorithm. His algorithm, has a lower makespan than the one in [8] in many 
cases, however (naturally) it has the same competitive ratio. A lower bound of 
the same value was given independently by Sgall [31]. For two uniformly related 
machines , the tight competitive ratio is known to be ( q + l) 2 / (q 2 + q + 1), given 
in [13] and independently in [33]. Those results are extended for a class of m 
uniformly related machines with non- decreasing speed ratios in [10]. The tight 
bound in that case is a function of all the speeds. A lower bound of 2 on the 
competitive ratio was given already in [14] . 

Semi-online preemptive scheduling: Seiden, Sgall and Woeginger [30] studied 

semi-online preemptive scheduling on identical machines. They showed that the 
most difficult case among non-increasing sequences is a sequence of unit size 
jobs. They gave a complete solution, where the competitive ratio is a function 
of the number to of machines. 

The problem of preemptive semi-online scheduling on two uniformly related 
machines without resource augmentation was studied by Epstein and Favrholdt 
[12], who proved a competitive ratio of max{^^, 2 q^+q+ 1 } w h ere l/<? is the 
speed of the slow machine. They also proved that idle time is necessary for q > 2. 
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Resource augmentation: This is a generalization of competitive analysis. Re- 

source augmentation was introduced by Kalyanasundaram and Pruhs [24]. They 
studied certain scheduling problems which are known to have unbounded com- 
petitive ratio. They showed that it is possible to attain a good competitive ratio 
if the machines of the on-line algorithm are slightly faster than the machines of 
the off-line algorithm. 

Resource augmentation has been applied to a number of problems. It was 
already used in the paper where the competitive ratio was introduced [32]. In 
that paper, the performance of paging algorithms was studied, where they have 
a larger cache memory than that of the optimal off-line algorithm. In several 
machine scheduling and load balancing problems [7, 23, 26, 3], the effect of adding 
more or faster machines has been studied. 



2 Preemptive Semi-online Scheduling 



In this section we consider semi-online preemptive scheduling on two uniformly 
related machines. Recall that jobs arrive sorted in non-increasing order of pro- 
cessing times. In the current section c(g, s) denotes the optimal competitive ratio 
for this problem. We prove the following theorem. 



c{q,s) = < 



Theorem 1. The optimal competitive ratio of deterministic or randomized semi- 
online preemptive algorithm is: 

1 for q < 3 and s < ^ 
for q > 3 and s < 
for q < 3 and yy < s < 2 
for q < 3 and s > 2 
for q > 3 and s > lip 



1 

3s(q+l) 

2q-\-3sq 



Zq+3sq 

3s(q-\-l) 

2q-\-3sq 

2s(g+l) 



q+l+2sq 



In the last two cases , only an algorithm which uses idle time can achieve this 
competitive ratio. 



See Figure 1 for the five cases. 




Fig. 1 . The partition to areas according to the five cases 
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We first prove the lower bound and then design the algorithms. The only 
algorithms which use idle time are for the cases where it is necessary. Note that 
due to the resource augmentation, there are two areas where the slow machine 
of the on-line algorithm is relatively fast and therefore an “optimal” algorithm 
can be achieved. In those cases, adding speed to the on-line algorithm allows us 
to overcome the on-line restriction. 



2.1 Lower Bound 

We start with two theorems that appeared in the literature and are useful for 
proving our results. 

The first theorem deals with the optimal preemptive schedule. Unlike the 
non-preemptive case there is a simple formula for calculating it. The theorem is 
a special case of a theorem given in [22] . 

Theorem 2. Given a set of jobs of total weight W, the largest of which has size 
l, the makespan of the optimal offline algorithm (whose machine speeds are 1 
and l/q) is max{£, }• 

The second theorem gives a “recipe” of computing lower bounds for random- 
ized on-line and semi-online algorithms. Similar theorems were given in [14, 10, 
12] and are adaptations of a theorem from [31]. 

Theorem 3. Consider a sequence of at least two jobs, where J„_i, J n are the 
last two jobs. Let OPT(Ji) be the preemptive optimal off-line cost after the arrival 
of Ji. The competitive ratio of any preemptive randomized on-line algorithm is 
at least OPT (j _ 1 ) +s opt(j ) ■ 

Proof. Fix a sequence of random bits used by the algorithm. Let ONL(J.() be 
the makespan of the preemptive on-line algorithm after scheduling Jj. Let Tj = 
ONL(J n ), and let T 2 be the last time which the on-line algorithm has a job 
scheduled on both machines. Since a job cannot be processed concurrently on 
both machines, we get ONL(J n _i) > T 2 . 

Suppose c is the competitive ratio of the on-line algorithm. Let P be the 
total size of all the jobs. Since only one machine can be non-idle during the time 
period between T 2 and Tj, we get P < (1 + \)T 2 + (Tj — Tf) = Tj + Taking 
the expectation we get 

P < U[Tj] + < E[ONL(J n )\ + E l° NL y - 1 )! < cOPT(J n ) + c ° p U J - 1 ) . 

Hence, c > OPT(J _ 1 I )f s oPT(j ) • 

Note that we did not use the assumption that the jobs arrive at a decreasing 
order of size, thus this theorem applies for the on-line case as well. 

Lemma 1. The competitive ratio of any on-line preemptive randomized algo- 
rithm is at least: 

max In - 2s(q+1) - 3s( « +1) ll 

max q+l+ 2 sq >'2 2g+3 sq ’ L J ' 
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Proof. Consider three sequences consisting of 1, 2 and 3 unit size jobs. For one 
job, no algorithm can do better than scheduling all of it on the fast machine at 
time 0, achieving a competitive ratio of 1. For sequences of 2 and 3 jobs, we get 
the lower bound by applying the above theorem. 

Lemma 2. For the cases {q < 3 , s > 2} and {q > 3, s > an on-line 

algorithm, which does not use idle time cannot achieve a competitive ratio of r 2 
and r\, respectively. 



Proof. Consider a sequence of two unit jobs. Set i = 1 for q > 3, i = 2 for q < 3. 
Under our assumptions, the first job cannot be assigned to the slow machine, 
since in this case OPT = 1. This would cause the competitive ratio to become 



s, which breaches our competitive ratio, since 

ri - s = - s = < 0 since s > ^ in this case, and 

r- a — 3s (l + X ) _ o _ n q+3 — 3sq n • q„_ v. C „„ A ^ a mVnno f 



r 2 — s = 2 q+ 3 sq ~ s ~ s q( 2 + 3 s) < 0 since 3 sq > 6 and g+3 < 6. Thus the first job 
must be scheduled on the fast machine at time 0. For two jobs, OPT = , thus 

the second job must be scheduled after time 1 and no later than time -r* on 
the fast machine, which means at least 1 — (^qr[U — 1) must be scheduled on the 

slow machine. For both these cases, this schedule is illegal since s ^2 — r^j > 1 
due to the following. 

For n, 2s-l-s^jn = glflflq > 0 for s > and for r 2 , 2s-l- s^ j 2 r r 2 
= ^5^ > 0 f° r s > 2. This implies that the second job is scheduled concurrently 
on both machines. 



2.2 Algorithms without Idle Time 

In this section we present algorithms for the three cases in which the creation 
of idle time in the schedules can be avoided. By the previous section, only three 
such cases may exist: {s < q < 3}, < s < 2, q < 3} and {s < q > 3}. 

Lemma 3. For max{l, -^} < s < 2,q < 3, there exists an on-line algorithm of 
competitive ratio r 2 = 2^+3^ which does not use idle time. 

Proof. The first job J\ is scheduled on the fast machine, without loss of gener- 
ality, we may assume it has size 1. 

As long as the total size of jobs does not exceed 1 + i, OPT = 1. Thus, we 
can use the interval 1 to r 2 on the fast machine without violating the competitive 
ratio. For the next jobs we use that interval first, and when it is full we go on 
to the time interval [0, s ^1 + ^ — r 2 ^] on the slow machine. When these two 
intervals are filled, the total size of the jobs scheduled is 1 + K At this point, 
some job may be only partially assigned. Denote this job J p . Since (1+ ^ — r 2 )s = 

(l + - — 3s ) s = 2s ffff- , = < I • f < 1, the load on the slow 

y q 2q+3sq J q(2+3s) 2+3s q — 8 1 — ’ 

machine does not exceed 1, hence even if until this point some other job was 
split between the two machines, its two parts do not overlap in time. 
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In this situation, the ratio of the loads is 3:2. From now on, we keep this ratio, 
so that the fast machine is always more loaded. Any arriving job of size x is split 
into two pieces of size (fast machine) and (slow machine). For the 
case that the total sum of sizes P is large enough (P > 1 + ^), OPT = The 
on-line load on the fast machine (which is larger) is and the competitive 

ratio is r 2 - This completes the description of the algorithm. To complete the 
proof we need to show the following: 

a. The second part of J p is scheduled properly. 

b. Any future job J is scheduled properly. 

Proof of a — case 1: J p was the second job to arrive: Since it is scheduled 
on the fast machine only after time 1, we need only to show that on the slow 
machine, it will be completed no later than time 1. Let x be the size of the 
second part of J p . We need to schedule 2 of the job (which consumes 
units of time on the slow machine). The size of the second job is at most 1, and 
1 of it was scheduled, so x < 1 — P Thus, after scheduling J pi we get that the 

load on the slow machine isgf^-^ + ^^gff^l+^+a:)^^^ 1 . 

Proof of a — case 2: J p was the third job to arrive or later: If J p does not 
have its first part scheduled on the fast machine, then this case is similar to a case 
here the first part and the second part are two different jobs, whose correctness 
is proven in the next section. Otherwise, J p < r 2 — 1, since it must be smaller 
than the second job, and the second job must be scheduled entirely on the first 
machine by the definition of the algorithm. In this case, scheduling it on the slow 
machine will cause a load of no more than (?’2 — l)s = ^q+Ssq — 3 9 + 3 s S — 1 (for 
s < 2). Thus, the part scheduled on the slow machine does not intersect J 2 , and 
thus it does not intersect J 3 . 

Proof of b: We need to show that the interval on the slow machine is enough 
to schedule gfqhp Let P be the total size of previous jobs. Then gyr^P is the 
load of the fast machine and gypgP is the load of the slow machine just before 
scheduling J. The extra load that J causes is Hence we need gppg > iS+jr 
This clearly holds since x < p because J is at least the third job. 

The proofs on the next two lemmas are omitted. 

Lemma 4. For s < pp,g > 3, there exists an on-line algorithm of competitive 
ratio 1 which does not use idle time. 

Lemma 5. For s < ~^,q < 3, there exists an on-line algorithm of competitive 
ratio 1 which does not use idle time. 

2.3 Algorithms with Idle Time 

In this section we present algorithms for cases which were previously proven to 
require idle time to reach our desired competitive ratio. Two such cases exist: 
{q < 3, s > 2}, for which we will prove a competitive ratio of 7*2 = 2 ^+^ > and 
{q > 3, s > Pp} for which we will prove a competitive ratio of r\ = • 
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Lemma 6. For every q there exists an on-line preemptive algorithm with com- 
petitive ratio of max { , tgg } • 

Proof. We define the algorithm. Without loss of generality, we assume the first 
job has size 1. Let r = if Q < 3, and q+i+ 2 sq °th erw i se - The first job is 

cut into two pieces: one of size which is scheduled on the slow machine until 
time r, and the other of size j+y which is scheduled on the fast machine starting 
from time 0. From now on, future jobs are scheduled on the first (possibly idle) 
time on the machine they are scheduled on. If a job is scheduled on the slow 
machine and there is no idle time left, it will be scheduled after time r. 

As in previous algorithms, as long as the total size of the jobs does not exceed 
1 + 1. these jobs are scheduled on the fast machine until time r, and then on 
the slow machine. When the total size of the jobs is 1 + 1, there is still idle 
time on the slow machine - by the definition of the algorithm, the idle time ends 
when the total size of the jobs is r(l + 1), which is greater than 1 + 1, Since 

r(l + 1) - 1 - 1 > min{(^T_, > 0 for q > 1. The loads on the 

machines, not including idle time, is r on the fast machine and s ^1 + 1 — r'j on 
the slow machine. The ratio of the loads is § if q < 3, and if q > 3. Since 

even after the first job OPT > 1, the competitive ratio is maintained up to this 
point . 

We continue by keeping the ratio between the loads. For q < 3 this means 
cutting any job of size x into two pieces, one of size , which is assigned to 
the first violating time on the fast machine, and the other of size 3 j+ 2 , which is 
assigned to the first available time on the slow machine. For q > 3 this means 
cutting any job of size x into two pieces, one of size q +i^ sq which is assigned to 

the first available time on the fast machine, and the other of size which 

is assigned to the first available time on the slow machine. Note that in both 
cases it is possible for the assignment of some job to take up the idle time on 
the slow machine, in which case the piece assigned to the slow machine is cut 
into two pieces, and is continued after time r. 

Again we need to prove the leftover on the job for which the total size reached 
1 + Ms scheduled properly, and that any future job J are assigned properly. Since 
the free time before time r on both machines is disjoint after scheduling the first 
job, there is no difference between a job and the leftover of a job. 

Case A: q < 3: 

Let P be the total size of previous jobs (such that P > 1 + |). Let x be the 
size of job J. Then yy|_p is the load of the fast machine and ^jb^P is the load 
of the slow machine before scheduling J. Scheduling J will cause an extra load 
of on the slow machine. Hence, we need to show that ■ This 

clearly holds if is at least the third job, which implies x < ^. Otherwise, J is a 
leftover and then x < 1 — A For q < 3, this implies that or<l— ^<|(1+|) 
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Case B: q > 3: 



Let P be the total size of previous jobs (such that P > 1+ 1). Let x be the size of 

job J. Then 2q ^ s q+1 P is the load of the fast machine and 2 qs+q + 1 ^ the l° a d 
of the slow machine before scheduling J. Scheduling J will cause an extra load of 
q+i+ 2 qs on the S ^ 0W mac hine. Hence, we need to show that < ^j ^ 2qs , 

which is implied by x < 2 ^yP. If J is the third job or later then x < P < 



Otherwise, J is a leftover and x < 1 — 

’ — 9 

rvt 1 t — (g — 1 )(g + 1 ) <• 9-1 9 + 1 <• 9-1 

- q q 2 + q ~ 9+1 f ' “ 9+1 



For q > 3, this implies 
P. 



3 Non-preemptive On-Line Scheduling 

In this section we study the problem of deterministic non-preemptive scheduling 
with respect to makespan on two machines. There is no restriction on the arriving 
jobs. Since preemption is not allowed, we can assume that the algorithms cannot 
use idle time, as idle time does not improve the situation. In the current section 
c{q , s) denotes the optimal competitive ratio for this problem. 

We prove the following theorem. 

Theorem 4. The optimal competitive ratio of deterministic on-line non-pre- 
emptive algorithm is c(q , s) = for s < and c(q, s) = for s > 

Note that 1 < c(q, s) < 2. 



3.1 Upper Bound 



We define the algorithm LIST. Schedule each job on the machine where it will 
be completed first. 



Lemma 7. LIST has a competitive ratio 



of min | 



(1+29)s 

(■8+1)9 



T) 



Proof. Consider the time just before the arrival of the job which determines the 
makespan. Let x be the weight of the fast machine of the on-line algorithm, 
and let y be the weight on the slow machine of the on-line algorithm at this 
point. Let £ be the size of the next job. Denote OPT the makespan of the 
schedule created by the optimal off-line algorithm, ALG the makespan of the 
schedule created by LIST, and c = pppp- We get OPT > = q ^ x +v+ l ' > 

and OPT > l. The on-line algorithm will assign the next job to the machine 
where it will finish first. Thus c{ONL) < x + l and c(ONL) < {y + l)s. We 
have (s + l)c(ONL) < s( x + y +21) < s(x + y + l) + si < OPT + sOPT. 
Thus c{ONL) < OPT ■ -> c < Moreover, LIST is definitely 

no worse than scheduling every job on the fast machine - hence, c < ®±i. Note 

that c = for every s > . This value varies between 1 and 2, and is equal 

to ^ w 1.618 for s=q. 
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3.2 Lower Bound 

We prove that no deterministic on-line algorithm can perform better than LIST. 

Lemma 8 . No deterministic on-line algorithm can have a better competitive 
ratio than c = min | , Ttl j _ 

Proof. First, we prove the lemma for s > In this case, c = 2 ±_ . We use a 
sequence of two jobs: 1, q. Denote the on-line algorithm ALG, and an optimal 
off-line algorithm OPT. If ALG schedules the first job on the slow machine, it 
gets a competitive ratio of s > 2 ±_. Thus in order to maintain a competitive 
ratio of less than 2 ±I ^ must schedule the first job on the fast machine. In this 
case, it performs no better then <7 + 1 on the second job since scheduling it on the 
fast machine gives a makespan of q + 1 , and scheduling it on the slow machine 
gives a makespan of qs > q + 1, while OPT easily achieves a makespan of q. 

For s < , Set a = q+1 ~ qs . Note that since s < a > 0. We start with 

presenting the algorithm a job of size 1. We have the following cases: 

Case 1: The first job is assigned to the slow machine. 

Set Tj = (1 + i) 1 . Let j be an integer number such that Tj < 1 + q , T ) +1 > 1 + q. 
Such a number j must exist since To = 1 and lim T n = oo. In this case, we 

n— >oo 

proceed with the following sequence: bo = 1 (The job which was presented at 
first), bi = ^ ,bi = ^(1 + ^)* _1 for all i > 2. Note that T,; = ^ b n , for all 

n— 0 

i > 0. We give jobs from this sequence until some job b t is scheduled on the fast 
machine, or until job bt = 67+1 is scheduled (on one of the machines), i.e. in 
total j + 2 jobs are scheduled. The following cases may occur: 

Case la: t < j + 1 and there is at least one job on each machine: 

After the first time the algorithm scheduled a job on the fast machine, we present 
a job of size qT t . The off-line algorithm achieves a makespan of qT t (scheduling 
all but the last job on the slow machine, and the last job on the fast machine). 
The on-line algorithm can either schedule that job on the slow machine or to the 
fast machine. If it is scheduled on the slow machine, we get a competitive ratio 

f sT -! + sqT _ stl+i) - 1 +sq(l+i) _ s I s(g+l-gs+ 9 s+g) _ (l+2g)s 

° qT - q( 1+A) ~ (1 +i)<? + “ q(s+ 1) - (s+1 )q 

Otherwise, it is scheduled on the fast machine, and the competitive ratio is 

b +qT -(1+-) 1 +Q'(1+-) 1 , 1 1 I 1 qs+s—q , 

qT - <?(l+i) “ 1 + q(l+a) - 1 + g( 1 + i±_^) - q(s+ 1 ) + 1 “ 

2 ( 7 g +5 s(l+2q) 

q(s+l) ~ (s+l)q ’ 

Case lb: t = j + 1 and all jobs are on one machine: 

We will show that either after j + 1 jobs or after j + 2 jobs, the competitive ratio 
is at least c. We denote by OPTk, ALGk the makespans after scheduling job bk 
of the optimal algorithm and the on-line algorithm, respectively. In these two 
cases ( k = j,j + 1), OPT can put the first job on the slow machine and all the 
next jobs on the fast machine, achieving a makespan of OPTj < q OPTj + i < 
Tj — 1. The on-line algorithm places all jobs on the slow machine, achieving a 
makespan ALGj = sTj,ALGj + ± = sTj + 1 . If ^- > s ^^i) the proof is complete 
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since > c. Otherwise, Tj < and Tj + 1 = Tj( 1 + L) < 1 s ^ 2 ] 9 (1 + 

T) = • Thus, the competitive ratio is at least x T + x-i = s (l + T ^- 1 ) — 

s ( 1+ -+±+ — l) = S (T+T^- 

Case 2: The first job is scheduled on the fast machine. 

Set Si = (1 + a) 1 . Let j be an integer number such that Sj < l+q,S j+1 > 1 + q. 
Such a value j must exist since S'o = 1 and lim S n = oo. In this case, we 

n—>oo 

proceed with the following sequence: do = 1 (The job which was presented at 

i 

first), ai = a, a,: = a(l + a ) 1-1 for all i > 2. Note that Si = a n ■ We give 

n — 0 

jobs from this sequence until some job is scheduled on the slow machine, or until 
j + 2 jobs are scheduled. Let k be the index of the last job. The following cases 
may occur: 



Case 2a: k < j + 1 and there is at least one job on each machine 
After the first time the algorithm scheduled a job on the fast machine, we give the 
on-line algorithm a job of size qS^ . The off-line algorithm achieves a makespan 
of qSk (scheduling all but the last job on the slow machine, and the last job to 
the fast machine). An on-line algorithm can either schedule the last job on the 
fast machine or on the slow machine. 



If it is scheduled on the fast machine, the competitive ratio is at least 

S — 1 (l+q) 1 +4(1 +oQ i I 1 -i I 1 qs+s—q 

qS ~ ?(i+q) — g(i+a) — ^(l- p+p yT j ~ q(s+ 1) 

2 gs+s s(l+ 2 q) 

40 +}) ~ 0 + 1)4 

If J is scheduled on the slow machine, the competitive ratio is at least 

sa +sqS 8«(l+a) ~ 1 +sg(l+q) S a i s+qs—qs 2 , „ 2q+l 

qS ~ 4(1 +a) — (l+a)4 ' — (5+1)4 ' ~ * (5+1)4 



Case 2b: k = j + 1 and all jobs are on one machine 

We show that after either j+1 jobs or j+ 2 jobs, the competitive ratio is at least c. 
We denote by OPT tl ALG t the makespans after scheduling job a t ( t = k,k + 1) 
of the optimal algorithm and the on-line algorithm, respectively. In this case, 
OPT can put the first job on its slow machine and all the next jobs on its 
fast machine, achieving a makespan of OPTj < < 7 , and OPTj + 1 < Sj+i — 1 . 
The on-line algorithm has scheduled the first j + 2 jobs on the fast machine, 
achieving a makespan ALGj = Sj, ALGj + 1 = Sj + ±. If ^ the proof 

is complete since gpp > c. Otherwise, Sj < and Sj+i = Sj ( 1 + a) < 



£(I+M(1 + a ) = (1 + 2 g) - 

s +1 



+1 



-1 



= 1 + 



+1 



-1 — 



> 1 + 



4+45 ■ 
1 

( 1+2 ) 



Thus, the competitive ratio is at least 
24+1 



-1 



(5+1)4 



4 Preemptive On-Line Scheduling 

We study the problem of deterministic and randomized scheduling with respect 
to makespan on two machines. Idle time is allowed but our algorithms do not 
use it. 
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In this section we let r(s, q) denote the competitive ratio function. We prove 
the following theorem: 

Theorem 5. The optimal competitive ratio of any deterministic or randomized 
on-line preemptive algorithm is r(s, q) = . 

Note that for all s and q, 1 < r < 2 holds. 



4.1 Lower Bound 



Lemma 9. No deterministic or randomized on-line algorithm can achieve a 
competitive ratio better than r = . 



Proof. We get this from theorem 1 stated in section 2, by using a sequence of 
sand with volume 1, followed by a job with size q. The optimal algorithm achieves 
a makespan of yyrr on the sand. This is done by assigning an amount of yyrr 



of the sand to the fast machine and the rest which is an amount of 



of 



it to the slow machine. A makespan of q on the entire sequence is achieved by 
assigning all the sand to the slow machine and the other job to the fast machine. 

Ps _ (g+jjg _ (<j+l) 2 s _ (<?+i) 2 s 

<3(l+s+qs) 



We eet c > — — — w +1 ) s — 

vve ger c p OPT (j _ 1 ) +s opt(j ) jpT+gS — 9+59(9+1) - 



4.2 Upper Bound 

We use the following notation. A new (arriving) job has size l. The makespan 
of the optimal off-line algorithm before scheduling t is OPT. The load of the 
fast machine of the on-line algorithm before scheduling l is L t , and the load of 
the slow machine of the on-line algorithm before scheduling l is L 2 . The total 
weight of all the jobs before l is w = L\ + Analogously, L\ , L' 2 , ui' and OPT ' 
are the load on the fast machine of the on-line algorithm, the load on the slow 
machine of the on-line algorithm and the total weight of the presented jobs and 
the makespan of the optimal off-line algorithm after scheduling t. 

n 

Lemma 10. For a given sequence of job ai, a 2 , . . . , a n , Let w = ^2 a n . If there 

2 = 1 

exists some i for which a,; > q(w — af), then it is unique and the competitive ratio 
of the optimal off-line algorithm is a P Otherwise, the makespan of the optimal 
off-line algorithm is w/(l + 1 /q). 

Proof. If a,; exists then it must be unique since a* > qw/(q + 1) > w/2. The 
bound on the makespan of the optimal off-line algorithm follows from Theorem 2. 

We define algorithm F: Whenever a new job l arrives, schedule a part as large as 
possible of it within the time interval [Li,rOPT'] on the fast machine starting 
from A, , and process the rest (if necessary) on the slow machine starting from 
L 2 • 

The proofs on the next three lemmas are omitted. 
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At any state of algorithm F, L 2 < 

For a given w, t the smallest available space for £ is achieved when 
At every state of algorithm F, the assignment is legal. 

5 Conclusion 

We gave tight bounds for three models. We did not study the on-line non- 
preemptive model. In [11] where this model is studied, the interval [1, 00 ) of 
the speed ratio between the two machines is split into 15 intervals, each of which 
stands for a totally different case. The proofs are very technical and therefore it 
seems that an extension to the resource augmented model would be too compli- 
cated. There are certainly many other two machine models which are interesting 
to extend. 
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Abstract. Scheduling precedence constrained jobs on identical parallel machines 
is a well investigated problem with many applications. AND/OR-networks con- 
stitute a useful generalization of standard precedence constraints where certain 
jobs can be executed as soon as at least one of their direct predecessors is com- 
pleted. For the problem of scheduling AND/OR-networks on parallel machines, 
we present a 2-approximation algorithm for the objective of minimizing the make- 
span. The main idea of the algorithm is to transform the AND/OR constraints into 
standard constraints. For the objective of minimizing the total weighted comple- 
tion time on one machine, scheduling AND/OR-networks is as hard to approx- 
imate as Label Cover. We show that list scheduling with shortest processing 
time rule is an O(yTT) -approximation for unit weights on one machine and an 
n-approximation for arbitrary weights. 



1 Introduction 

Scheduling precedence constrained jobs on identical parallel machines is a well investi- 
gated problem with many applications. A precedence constraint of the form (i-j) means 
that job j can be started only after the completion of job i. If there are several jobs i 
such that (i-j) is a precedence constraint, the job j can be started only after all of these 
jobs i are completed (AND-constraint). A natural and useful generalization of standard 
precedence constraints are AND/OR precedence constraints, represented by AND/OR- 
networks. In addition to standard constraints, they allow to specify that a node can begin 
execution as soon as at least one of its direct predecessors is completed (OR-constraint). 
AND/OR-networks arise in many applications, such as resource-constrained project 
scheduling [18] and assembly/disassembly sequencing [9]. In the latter application, a 
given product may have to be disassembled. Certain components can be removed only 
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after the removal of other components, leading to standard AND-constraints. However, 
it might also be possible to remove a component from one of several geometric di- 
rections, assuming that other components blocking this direction have been removed 
beforehand. This case naturally leads to OR-constraints. Therefore, the different pos- 
sibilities how to reach a component can be suitably modeled by AND/OR precedence 
constraints. 

If a given set of jobs with AND/OR precedence constraints has to be executed on a 
bounded number of identical machines (e.g., assembly lines, workers, etc.), interesting 
optimization problems arise. Natural objectives are the makespan (the completion time 
of the job that finishes last), the total completion time (the sum of the completion times 
of all jobs), and the total weighted completion time (where the completion time of each 
job is multiplied with the weight of the job). While these problems have been studied 
intensively for standard precedence constraints, only little is known about scheduling 
AND/OR-networks on one or several machines. 

1.1 Known Results 

Scheduling with the makespan objective is trivial on one machine if the jobs are not 
restricted at all, but also if standard or AND/OR precedence constraints are imposed 
among the jobs. The problem on identical parallel machines is NP-complete even with- 
out precedence constraints and with just two machines. Nevertheless, Graham’s list 
scheduling provides a simple 2-approximation for scheduling precedence constrained 
jobs on any number of parallel machines [10]. Gillies and Liu [7] present a 2-approxi- 
mation for scheduling jobs of an acyclic AND/OR-network. Their algorithm first trans- 
forms the AND/OR-network into a standard precedence graph and then applies Gra- 
ham’s list scheduling. 

Minimizing the total weighted completion time is a non-trivial problem even on a 
single machine. If there are no precedence constraints among the jobs, it is well known 
that scheduling the jobs according to Smith ’s rule [20] (in order of non-decreasing pro- 
cessing time over weight ratio) yields an optimal solution. For the total weighted com- 
pletion time of standard precedence constrained jobs on one machine, 2-approximation 
algorithms have been obtained with various techniques [12,3,2, 17], For the problem 
with identical parallel machines, a 4-approximation algorithm was presented in [19]. 
Unfortunately, it seems that the techniques used to obtain these algorithms resist any 
application to AND/OR precedence constraints. 

1.2 Our Results 

We study scheduling problems with AND/OR precedence constrained jobs on one 
or several machines. For the problem of minimizing the makespan on identical par- 
allel machines, we extend the algorithm of Gillies and Liu [7] to general feasible 
AND/OR-networks (which may contain cycles) in order to obtain a 2-approximation 
algorithm. Then we consider the problem of minimizing the total weighted completion 
time on a single machine. For the special case with unit weights (i.e., total completion 
time), we prove that list scheduling with shortest processing time rule is an 0(^/n)- 
approximation, where n is the number of jobs. This bound is tight. The case with ar- 
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bitrary weights seems to be harder to approximate. We observe that a reduction from 
Label Cover proposed by Goldwasser and Motwani in [8, 9] together with an im- 
proved inapproximability result for Label Cover by Dinur and Safra [4] shows that 
minimizing the total weighted completion time to within a factor of 2 logl 1/,oglog n 
of the optimum is NP-hard for any c < 1/2. We prove that list scheduling with shortest 
processing time rule is a simple n-approximation for the problem and that this bound is 
again tight. 



2 Preliminaries 

2.1 Problem Description 

A scheduling instance consists of a set of n jobs V = {1,2,..., n}. With each job 
j £ V we associate a (strictly) positive processing time pj and a non-negative weight 
u>j. In addition to the jobs, a number m of identical parallel machines will be given. The 
machines all run with the same speed and can process any job. Of course, at any point 
in time, each machine can process only one job and every job can only be processed 
by one machine. We will restrict to the non-preemptive case. Once a job is started on a 
certain machine, it will be finished on that machine without any interruption. 

The jobs are subject to two different classes of constraints. On one hand, standard 
precedence constraints represented by a directed acyclic graph G = ( V,E(G )) are 
given. An edge (i,j) £ E{G) represents the constraint that j has to wait for the com- 
pletion of i. In a feasible realization of such a problem, all jobs have to be executed 
in accordance to the partial order induced by G. Each job j £ V has to wait for the 
completion of all its predecessors in the partial order and thus will also be called an 
AND-node. On the other hand, we allow for precedence relations of the form that a job 
j has to wait for the completion of only one predecessor. Those restrictions cannot be 
captured by the classical precedence constraints described above. The model of stan- 
dard precedence constraints can be generalized by the introduction of a set W of waiting 
conditions. A waiting condition is an ordered pair w = ( X , j), where X C V is a set of 
jobs and j £ V\X is the waiting job. The waiting job / can be processed as soon as one 
of the jobs in X has been completed. The standard precedence constraints together with 
the waiting conditions can be represented by an AND/OR-network N = (V U W, E) in 
the following way: for every waiting condition w = (X, j) £ W, we introduce an OR- 
node, which we will denote by w again. For every x £ X, we introduce a directed edge 
(x, w). In addition, there is a directed edge (w,j) for the waiting job j. To model the 
required constraints correctly, we impose the rule that an OR-node w can be scheduled 
as soon as any of its predecessors x £ X is completed. An OR-node w £ W can be 
considered as a pure dummy node with processing time p w = 0 and weight lo w = 0. 
For convenience, we assume N to have a common start vertex, the source s, and a com- 
mon end vertex, the sink t, with p s = Pt = 0 and ui s = uj t = 0. For an illustration, an 
AND/OR-network is depicted in the left part of Fig. 1 , where node w 3 represents the 
waiting condition ({5, 6 }, 7), for example. 

In contrast to standard precedence constraints, an AND/OR-network may contain 
cycles and be feasible at the same time. Remember that the AND/OR-network N = 
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Fig. 1 . An AND/OR- network N on the left and a realization of N on the right side. AND-nodes 
are drawn as circles and OR-nodes as shaded squares. 



{V U W. E) contains the precedence digraph G as a subgraph. A realization of an 
AND/OR-network N is a partial order R = ( V. < r) which is an extension of G, i.e. 
i <f> j for each (i, j) £ E with i,j £ V, and 

for each w = ( X , j ) £ W, there exists x £ X with x <r j. 

A linear or total order L = ( V , <l), which is a realization of N, is called linear re- 
alization. In Fig. 1 a realization of the AND/OR-network N on the left is depicted on 
the right hand side. A total order is given by L = 1, 2, 3, 4, 5, 6, 7, for example. An 
AND/OR-network N is called feasible if it has a realization R. We refer to [13, 18] for 
additional characterizations of feasible AND/OR-networks and a linear time algorithm 
to check for feasibility. 

For a given instance ( N = (V U W, E),p,u>,m), a feasible schedule is a non- 
negative vector of start times S = (Si, . . . , S n ) for the jobs and waiting conditions 
such that 

1. Sj > max{S„ + p v | (v,j) £ E} for all j £ V ( AND-constraints ) 

2. S w > in in {S\, + p v | ( v,w ) £ E} for all w £ W ( OR-constraints ) 

3. j {j £ V | Sj < t < Sj + pj } | < m at any time f > 0 ( machine constraints ) 

For a given schedule S. the corresponding completion time vector C is defined by Cj = 
Sj + pj. The makespan of schedule S is denoted by C' max (S) = maxjgy Cj = C t . 
The total completion time of S is Cj and the total weighted completion time is 

Let us consider the problem without machine constraints, i.e. m = oo. By defi- 
nition, S = ( oo , . . . , oo) is a feasible schedule and if S = (Si, . . . , S n ) and S' = 
(S(, . . . , S' n ) are feasible schedules, then also the component wise minimal schedule 
S" = (minjSj ; Sj}, . . . , min{S„, S' n }). It follows that there exists a (unique) com- 
ponent wise minimal schedule, the so-called earliest start schedule ( ES ). We assume 
strictly positive job processing times, thus an earliest start schedule can be computed 
by a modification of Dijkstra’s shortest path algorithm in polynomial time. For further 
details we refer to [13]. Similar algorithms have been presented before in [14, 1, 18]. 
The earliest start schedule ES has the following property. For every job j £ V, the 
inequality ESj > ma x{ES v + p v \ (v,j) £ E} is fulfilled with equality for at least 
one edge (v,j) £ E as otherwise j could start earlier. For the same reason, for each 
OR-node w £ W, the inequality ES W > min{f?S , 1 , + p v \ ( v , w) £ Ej is also fulfilled 
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with equality for at least one edge. We call those edges (v, j) and ( v,w ), for which 
equality holds, tight. 

In the earliest start schedule, every job is started as early as possible without vio- 
lating any constraint. Therefore it also finishes as early as possible. For the problem 
without machine constraints it follows that the earliest start schedule ES is an optimal 
schedule for all three objective functions, the makespan, the total completion time, and 
the total weighted completion time. In the following we will always denote an optimal 
schedule for a given problem by S* = (5*, . . . , S*). 

To describe the problem under consideration quickly we will use the a \(3\ 7 -notation 
of Graham, Lawler, Lenstra, and Rinnooy Kan [11]. The first entry, a, specifies the ma- 
chine environment, (3 describes the job characteristics, where [3 = ao-prec denotes 
AND/OR precedence constraints, and 7 specifies the objective function. We refer to 
[11,16] for a detailed description. As an example, the problem of scheduling jobs with 
unit processing times and AND/OR precedence constraints on one machine with the ob- 
jective to minimize the total completion time is specified by 1 | ao-prec, pj = 1 | 'ffCj. 

2.2 List Scheduling 

One of the most basic and intuitive approaches to tackle a given scheduling problem is 
list scheduling. The idea is to order the jobs according to some priority rule, which gives 
an ordered list. The jobs are then scheduled according to this priority list. Whenever a 
machine is free, the first job in the list that has not been scheduled yet and is available 
at that time is assigned to be processed by the free machine. List scheduling can easily 
be implemented to run in polynomial time. According to [16], this algorithm has been 
presented first by Graham [10] and we will also refer to it as Graham’s list scheduling 
(GLS). 

In the literature, list scheduling is also called a priority-driven heuristic, according 
to the priority list that guides the order in which the jobs are scheduled. Priority driven 
heuristics never intentionally leave machines idle. A machine is only left idle if there 
are currently no jobs available. This means that for every job in the list, at least one 
predecessor is still in process. Recursing this argument gives a well known fact for 
standard precedence constraints stated in the next lemma. Let G — ( V. E) be a standard 
(acyclic) precedence graph with a strictly positive processing time vector p and u,v £ 
V two nodes such that there exists a w-v-path in G. The length of a longest path from u 
to v shall be denoted by f™ ax , where C™ ax = p v if u = v and /™ ax = p v + max{f“ ax | 
(x, v) £ E and there exists a w-a’-path in G} if 11 / 0 . 

Lemma 1 (Graham [10]). //z any list schedule for a set of precedence constrained jobs, 
there is an s-t-path of jobs that is executed during all periods when some machine is 
idle, and the length of this path is not longer than the makespan of an optimal schedule. 

The big advantage of Graham’s list scheduling is that it is easy, both to understand 
and to implement, and that it is applicable for a wide range of problems. 

3 The Makespan on Identical Parallel Machines 

Minimizing the makespan of a set of precedence constrained jobs on one machine is 
trivial. If the jobs are not restricted at all, we can simply schedule them in any order 
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Algorithm 1: Earliest Start List Scheduling. 

Input: AND/OR-network N = (V U W, E), number m > 1 of machines, processing 
time vector p > 0 

Output: schedule S = (Si, . . . , S n ) 

compute earliest start schedule ES for (N,p) (no machine constraints); 
let G = ( V , E(G)) be the subgraph of N induced by V ; 
foreach OR-node w = (X,j) € I V’ do 

for exactly one x £ X with ES X + p x = ES W do 
j_ add edge ( x,j ) to E(G)\ 

let L be a linear extension of the resulting AND-graph; 

apply list scheduling to the AND-graph G with p and L to obtain the schedule S'; 

return S; 



without interruption and idle time in between. If standard precedence constraints are 
involved, we can process the jobs in order of a linear extension of the partial order 
G representing the precedences. In the case of AND/OR precedence constraints, the 
solution is equally simple, the jobs can be executed in the order of a linear realization of 
the AND/OR-network. In every case, the makespan is equal to the sum of the processing 
times of the jobs, assuming that the constraints are feasible of course. 

Now consider the case of scheduling a set of precedence constrained jobs on m 
identical parallel machines. The problem Pm\ C' max is already NP-complete for m — 
2, but can be solved in pseudo-polynomial time for any fixed number m of machines. 
For an arbitrary number of machines, i.e. P\ |C max , the problem is NP-complete in the 
strong sense [5]. 

Nevertheless, Graham’s list scheduling performs provably well for this problem. 
Let S' 01 " 8 denote a list schedule for P\ ICmax on m machines. Graham [10] proved 
that, for any instance, C'max (-S' 01 " 8 ) < (2 — C max (S'*). He also showed that this 
performance guarantee is not affected by precedence constraints. Thus list scheduling 
is a 2-approximation for P|prec|C max . 

As a generalization of standard precedence constraints, the problem of minimiz- 
ing the makespan of AND/OR constrained jobs on identical parallel machines, P|ao- 
prec|C max , is NP-complete in the strong sense. Fortunately, the problem with AND/OR 
precedence constraints can be reduced to the problem with standard precedence con- 
straints preserving the approximation guarantee. Gillies and Liu [7] present a 2-appro- 
ximation algorithm for acyclic AND/OR-networks. The basic idea of the so-called min- 
imum path heuristic in [7] is to first change the AND/OR-network into an AND-only 
network, i.e. a standard precedence digraph, and then apply Graham’s list scheduling. 
For this transformation, Gillies and Liu use a recursive argument which minimises the 
longest path to each OR-node provided that only AND-nodes precede the OR-node. 
This construction fails in the presence of cycles. We present an algorithm similar to the 
minimum path heuristic which is applicable to AND/OR-networks containing cycles. 

The strategy of our earliest start list scheduling (ESL) presented in Algorithm 1 is 
to fix the predecessor of an OR-node to one of the tight (with respect to the earliest start 
schedule) direct predecessors. Then the OR-node can be removed by making the cho- 
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sen predecessor a direct predecessor of the immediate successor of the OR-node. Note 
that the resulting precedence graph G is a realization of N. In addition, it minimises 
the longest path to each OR-node, respectively its waiting job, among all realizations. 
Every step of the earliest start list scheduling can be executed in polynomial time, thus 
the earliest start list scheduling can be implemented to run in polynomial time. To- 
gether with Lemma 1 by Graham and the optimality of the earliest start schedule with 
respect to the makespan objective without machine constraints, we are able to prove an 
approximation guarantee of 2. 

Theorem 1. Let S'® 81 " denote an earliest start list schedule and S* an optimal schedule 
for an instance of P\ao-prec\C max . Then 

G max (S ESL ) < (2 — — ) G max (S*). 
m 



Moreover, this bound is tight. 

Proof Consider an earliest start list schedule ,S E8L and its makespan C max (S lESL ). At 
any time 0 < t < G max (S lESL ) either all machines are busy or one or more machines 
are idle. Accordingly, we can divide the time between 0 and C l max (S lESL ) into busy 
periods and idle periods. We denote the total length of all busy periods by T b and the 
total length of all idle periods by Ti, thus G max (»S' ESL ) = T b + Tj. 

For the total length of idle periods, Tj < £™ ax (G) by Lemma 1. From the con- 
struction of graph G we get that £™ ax (G) = C max {ES), where ES is the earliest start 
schedule of (N, p) without machine constraints computed in the first step of the algo- 
rithm. The earliest start schedule achieves the optimal makespan for an instance ( N , p) 
and therefore is a lower bound on G max (5*). Together this yields Ti < C max (S*). 

Additionally, we can compare the total processing time of all jobs with the makespan 
of S* and <S' ESL . A trivial lower bound on the makespan of any feasible schedule — and 
thus also an optimal schedule — is Pj- The earliest start list scheduling has to 

process all the jobs. During a busy period, all m machines process some job, while 
during an idle period at least one machine processes some job. We therefore get that 
mT b + 1 Ti < J2 ieV Pj < mG max (S*). The worst case for G max (S' ESL ) = T b + T t 
subject to the presented constraints is achieved for T] = G rnax (»S'*) and T b = ( 1 — 
l/m)G max (S 1 *), which yields the result. 

Examples for standard precedence graphs that achieve the worst-case bound of Gra- 
ham’s list scheduling can be found in [10] or [6]. If we apply the earliest start list 
scheduling to such instances, there is nothing to be done in the first part of the algorithm 
and the performance guarantee will be achieved by the last part, the list scheduling. This 
proves the tightness of the stated approximation ratio and completes the proof. □ 

We observe that the presence of additional OR constraints does not seem to make 
it any harder to minimize the makespan of precedence constrained jobs on identical 
parallel machines. Unfortunately the situation changes completely for the total weighted 
completion time objective. 
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4 The Total Weighted Completion Time on One Machine 

The problem of minimizing the total weighted completion time on one machine is NP- 
hard in the strong sense as soon as precedence constraints are involved [15]. Again this 
complexity result carries over to AND/OR precedence constraints. 

In contrast to the makespan objective, minimizing the total weighted completion 
time of a set of AND/OR constrained jobs is much harder than for standard prece- 
dence constraints. In fact, it is not approximable within any reasonable factor. There- 
fore we can expect that the successful approaches for standard precedence constraints 
will not be applicable to AND/OR precedence constraints. We will examine the per- 
formance of list scheduling with shortest processing time (SPT) rule on the problem 
l\ao-prec\J2 UjCj for both cases of unit and arbitrary weights. The SPT rule sim- 
ply orders the jobs according to non-decreasing processing times. It turns out that the 
greedy approach of list scheduling with SPT rule results in a useful property of the 
computed schedule with respect to any other feasible schedule. 

Let (N = (V U W,E),p,ui) be an instance of l|ao-prec| ^WjCj and let S' 8131, 
with completion time vector C' SP 1 be the schedule computed for this instance by list 
scheduling with SPT rule. Consider an arbitrary feasible schedule S and its completion 
time vector C for (N,p,ui). For any job j £ V we define a threshold &(S) with 
respect to schedule S : &(S) = ma x{p; | Cj < Cj } , i.e. the maximum processing 
time of a job equal to j or scheduled before j in S. The vector of thresholds, £(£) = 
(/j 1 (S ), . . . ,£”(£)), is called the threshold of schedule S. 

Lemma 2. Let S be a feasible schedule for an instance ( N , p, w) with threshold £(S). 
Then for every job j £ V it holds that Pi < ffS) for all i with C 8PT < Cf PT . 

Proof The proof is accomplished by induction along the order of the jobs in the feasible 
schedule S. To this end we assume without loss of generality that the jobs are numbered 
such that S\ < S 2 < ■ ■ ■ < S n . In addition, we write £ for short instead of £,(S). 

First we make the following observation for schedule ,S’ sl ’ T , which we will refer 
to as the greedy choice argument. Let t be the point in time, at which job j becomes 
available, i.e. in some realization of N all jobs in the predecessor set of j have been 
completed before t. Then, by the greedy choice of list scheduling with SPT rule, no job 
with a longer processing time than j itself (and therefore coming after j in the list) will 
be scheduled between t and Sj . 

Consider job 1. By definition, = p-\ . In addition, job 1 is available at time t = 0 
as it is scheduled at that time in the feasible schedule S and thus cannot have any direct 
predecessors except s. The claim follows by the greedy choice argument. 

Now consider job k and assume that for all j = 1, . . . , k — 1 it holds that Pi < 
for all i with Cf PT < Cf PT - We have to distinguish between the two cases that the 
threshold of k is greater or equal to the threshold of k - 1 . 

Casea)£ fe > £ fc_1 . From the definition of £ it follows that £f = pu and thus pk > £ fc_1 . 
Let x £ {1, . . . , k — 1} be the job that finishes last in the list schedule, i.e. C 8PT = 
max{C' 8P1 | j = 1, . . . , k — 1}. By assumption. Pi < £f < £ fc_1 < p k for all i with 
Cf PT < We conclude that k is scheduled after x in the list schedule and j/j < £ fc 

for all i with Cf PT < C| PT . From the feasibility of schedule S and the maximal choice 
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of x in S' SPT it follows that k is available at time Cf PT . By the greedy choice argument, 
no job with a strictly longer processing time than k will be scheduled between Cf PT 
and Cf PT , which proves the first case. 

Case b) = £ fc_1 . In this case, k may finish before or after some other job j < k. 
Again let x £ {1, . . . , fc— 1} be such that Cf PT = maxIC® 13,1 j = 1, . . . , k — 1}. If k 
completes before x, i.e. Cf PT < Cf PT then the claim trivially holds as by assumption 
Pi < £ x < £ fc_1 = for all i with Cf PT < C| PT . If on the other hand k completes 
after x, we can again argue as in case a). For the jobs completed before Cf PT the claim 
holds by assumption and for the jobs completed between Cf PT and C® PT it follows by 
the greedy choice argument. □ 

Note that this property of S^ 1 holds for the threshold of any feasible schedule S 
and thus in particular for the threshold of an optimal solution S * , independent of the 
optimality criterion. 

Theorem 2. Scheduling a set of n AND/OR precedence constrained jobs in order of 
non-decreasing processing times (SPT) is an O(yfn) -approximation for the problem 
l|ao-prec| Cj. Moreover, this bound is tight. 

Proof Consider the schedule S' SPT with completion time vector C SP 1 for an instance 
(N,p) of l\ao-prec\ Cj with n jobs. Let x £ V be the last job in the list schedule for 
which x itself and every job completed before x in the list schedule have a processing 
time smaller than or equal to C^ PT /y/n, if such a job exists. Formally, x is chosen such 
that C® PT = maxjgy {C® PT | p t < C® PT / y/n Vi with Cf PT < C® p 1 }. If such an x 
exists, let V- = {j £ V | C'f PT < C'| PT } denote the set of jobs that are scheduled 
before x including x itself. The set of jobs scheduled after x in the list schedule shall 
be denoted by V > = {j £ V | C® PT > C| pri }■ If no such x exists, then V- = 0 and 
V > = V. We have to treat the jobs in the two sets separately. 

First consider V-. The total completion time of the list schedule for the jobs in 
V- can be generously bounded from above by '}Z /e y< C® PT < nC® PT . Now we 
have to bound the total completion time of an optimal schedule for the jobs in V- 
from below. By construction, p t < C^ PT / y/n for every j £ V-. Thus, there are at 
least r > sffi jobs in V- and we want to minimize their total completion time. Let 
ji,... ,j r t> e the jobs in V- such that Cf < ■ ■ ■ < C* . Since Y^i=iPj — 
we obtain the following inequality: Jf jGV < C* > C® PT + (Cl 13,1 — pj ) + (C® PT — 
Pj — Pj Ci) + ‘ ‘ ‘ + Pi- The sum on the right hand side is minimized if we descend 
from the maximal summand C| PT as fast as possible and in as few steps as possible. 
Since pj < C® PT / y/n for i = 1, . . . , r, this is achieved if the processing time of every 
job ji, i = 1, . . . , r, takes its maximum possible value pj = C^ PT fy/n. We then get 
Jfj eV < Cj > *Cf PT / > (v / A/ 2)C® pt . Combining upper and lower bound 

for the jobs in V- yields 

E C f T < 2v^ E C ! ■ (D 

je v< jev< 

Now consider the jobs in V > . By definition, for every j £ V > there exists a job k 
with pk > Cj PT /y/n that is either equal to j or scheduled before j in the list schedule. 
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Consider an optimal schedule S* and its corresponding threshold &(S*) for j. By 
Lemma 2 we know that j and every job scheduled before j in the list schedule has a 
processing time smaller than or equal to (S*). This yields 




< Pk < < C* 



( 2 ) 



as (S'* j is a trivial lower bound on C* by definition. 

Combining (1) and (2) proves the approximation guarantee: 

E c f T <E c 'f T + E c f T ^E c ;+ 

jev jev< jev> jev< jev> jev 

We have shown that list scheduling with SPT rule is a ^-approximation algorithm 
with p £ 0(y / n), it remains to prove that also p £ Already for standard 

precedence graphs, the approximation ratio for list scheduling with SPT rule is bounded 
from below by fi(y/n) and therefore this also holds for the general case of AND/OR 
precedence constraints. We will demonstrate this by presenting a series of digraphs that 
force the solution obtained by list scheduling with SPT rule to be a factor of 0{^/n) 
away from the optimum. 

Let k > 0 be an integer and consider the following precedence graph G = ( V. E). 
The set of jobs V consists of k jobs i\ , . . . , ik, one job x, and k 2 jobs j i , . . . , jp 2 that 
form a chain, preceded by x. Thus in E there are the edges (x,ji) and (j K ,j K + 1 ) for 
all 1 < k < k 2 — 1. We have the following job processing times: pi = k 2 , for 
k = 1 , . . . , k, p x = k 2 , and pj = 1 , for all k = 1 , . . . , k 2 . 

The SPT rule orders the jobs according to non-decreasing processing time and thus 
a possible ordering is L = ji , ... , j fc 2 , ik, x. Note that we can force the SPT rule 
to construct this bad list L by slightly modifying the processing times. List scheduling 
then computes a schedule with the following order of the jobs: i\ . . , i ^ , x, j\ , . . . , ■ 

For the objective value of this schedule we get that 

fc+i 

E c f PT ^ E a ’ 2 + fc2 ( fc + 1 ) fc2 = 

jev <=£ 

The optimal schedule prefers job x, which can release the long chain of short jobs. 
Therefore, an exact algorithm produces the schedule: x,j i , . . . , jk 2 , ii , . ■ ■ , ik- For the 
value of this optimal schedule we can calculate that 

fc +2 

E Cj < (k 2 + 1)2 k 2 + E ffc2 = °( fc4 ) ■ (4) 

jev r=3 

Thus the performance ratio of list scheduling for this instance is 17 (k 5 ) / 0(k 4 ) = fi(k). 
The precedence graph has n £ 0(k 2 ) many vertices, which yields an approximation 
guarantee of 0(y/n) for list scheduling with SPT rule on this instance for n respectively 
k going towards infinity. □ 
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Coming back to the weighted problem l|ao-prec| J} Cj , we are actually able to 
establish a strong inapproximability result. Goldwasser and Motwani [8] present an ap- 
proximation preserving reduction from Label Cover to a special case of scheduling 
with AND/OR precedence constraints arising in disassembly problems. 

An instance of a problem considered in [8] is given by a set of jobs, called tasks, 
with unit processing times. The jobs are either AND- or OR-nodes and there are no 
weights involved. The AND/OR-network representing the precedence constraints is as- 
sumed to have internal-tree structure, which is a slight generalisation of a tree. We call a 
node of the network a leaf, if it has no direct predecessors. An AND/OR-network N has 
internal-tree structure, if N \ {j £ V \ j is a leaf} is an in-tree. The scheduling model 
requires that for an AND-node to be scheduled, all its direct predecessors have to be 
processed before the AND-node, and for an OR-node only one direct predecessor has 
to be scheduled before the OR-node and the others may be left completely unprocessed. 
The goal of AND/OR scheduling is to minimize a) the number of leaves or b) the num- 
ber of jobs (AND- and OR-nodes) that need to be scheduled to be able to schedule some 
sink y of the network, and thus solving the problem instance. Goldwasser and Motwani 
show that Label Cover is a special case of scheduling AND/OR constrained jobs 
subject to objective a) and that a) can be reduced to b), preserving the approximation 
guarantee. 

Minimizing the number of jobs, i.e. objective b), that need to be scheduled to be 
able to schedule some sink y of an AND/OR-network can be modeled as a special case 
of l|ao-prec| In the given AND/OR scheduling instance we assign unit pro- 

cessing times to all AND-nodes and zero processing time to the OR-nodes. In addition 
we set ojy = 1 and u> v = 0 for every other node v £ V U W . A solution S' to the 
problem of scheduling a minimum number of AND/OR constrained jobs corresponds 
to a solution to 1| ao-prec,pj = 1| J} t OjCj that minimises the total weighted comple- 
tion time oJjCj = C y : Simply schedule all jobs contained in S' before y and 

all other jobs afterwards. The total weighted completion time is equal to the number of 
AND-nodes of solution S'. 

Together with a strengthened inapproximability result for Label Cover by Dinur 
and Safra [4] we can make the following statement. 

Theorem 3. The scheduling problem l\ao-prec\Y2 UJ jCj with unit processing times 
is NP-hard to approximate within a factor of 2 log n of the optimum value, where 
7=1 / (log log n) c for any constant c <1/2 and n = \V\ is the number of jobs. 

For a detailed presentation of the proof we refer to [13]. Theorem 3 shows that 
l\ao-prec\ is a very hard problem even if all processing times are equal to 

one. Interestingly enough, the next theorem proves that list scheduling with shortest 
processing time rule is an easy /(-approximation for this problem, and it seems difficult 
to achieve an asymptotically better ratio. 

Theorem 4. Scheduling a set of n weighted AND/OR precedence constrained jobs in 
order of non-decreasing processing times (SPT) is an n-approximation for the problem 
l|ao-prec| ^ a JjCj. Moreover, this bound is tight. 

Proof. Let (N = {V U W,E),p,to) be an instance of l|ao-prec| uJj Cj . Without 

loss of generality assume that the jobs are numbered such that pi < P 2 < • • • < p n . 
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Let S' SPT and C SPT denote the schedule and its completion time vector produced by 
Graham’s list scheduling with the list L = 1, . . . , n. The optimal schedule and its 
completion time vector are denoted by S* and C* as usual. By definition, the threshold 
of job j with respect to S* is a lower bound on C* . Now we have to bound 
Cf 1 from above. Lemma 2 states that for each j £ V it holds that p.-, < £•? (S * ) for all 
jobs i with Cf 15 " 1 " < C® 1 PT . For every j £ V we therefore get that 

Cf T < £ Pi < < <( S *) < nC* , 

p <« (s*) P <z (S*) 

which yields the upper bound on the approximation ratio on a per job basis. 

To prove the lower bound we present a series of AND/OR-networks that force the 
solution of list scheduling to be a factor of n away from the optimum. 

Let k > 0 be an integer and consider the following AND/OR-network TV = (V U 
W, E). The set of jobs V consists of k + 2 jobs, i\, . . . ,ik, x, and;/. In addition there is 
one OR-node w £ W. The edge set E contains the edges (i K . i K +\) for k = 1, . . . , k — 
1, ( ik,w ), (x,w), and ( w,j ). The processing times of all jobs are equal to one. The 
weights are u>i =0 for all k = 1, . . . , k, uj x = 0, and loj = 1. 

The constructed AND/OR-network is very simple, it consists of one OR-node w 
with its direct successor j. The two predecessors of the OR-node w are one chain of k 
nodes and one single node x. If we sort the jobs in order of non-decreasing processing 
times, we may get the bad list L = j, i\, . . . , ik, x. Again, by slightly modifying the 
processing times we can force the shortest processing time rule to choose this bad list 
L. List scheduling then computes the schedule i i, . . . , ik,j, x with an objective function 
value of Yujev u i C f PT = k + 1. 

The optimal schedule prefers job x, as it can release j, which is the only job with 
positive weight. Therefore, an exact algorithm produces the schedule x,j,i\, . . . ,ik 
with a value of Y^jev = 2- 

The performance ratio of list scheduling for this instance is = &{k). The 
AND/OR-network has n = k + 2 £ O(k) many jobs, which yields a performance ratio 
of 0(n) for the list scheduling algorithm on this instance for n respectively k going 
towards infinity. □ 

The constructed example has very restricted values, thus the performance guarantee 
even holds for the problem 1 1 ao -prec,pj = 1 ,u>j £ {0, 1}| In addition, it also 

provides a lower bound of f2(ri) for list scheduling with Smith’s rule. 

We remark that the ideas from the proof of Theorem 4 can easily be generalized 
to show that list scheduling with SPT rule is also an n-approximation algorithm for 
P | ao-prec \ Y Cj- . 

5 Conclusions 

We want to briefly summarize the stated results. In Sect. 3 we have seen that minimiz- 
ing the makespan on identical parallel machines subject to AND/OR precedence con- 
straints can be approximated within a factor of 2. This approximation guarantee was 
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established by transforming the AND/OR precedence constraints into standard prece- 
dence constraints and applying list scheduling to the resulting instance. It is remarkable 
that in the case of the makespan objective, additional OR constraints do not seem to 
make the scheduling problem any harder. 

In contrast to this, a major gap occurs in the approximability of minimizing the total 
weighted completion time for standard and AND/OR precedence constrained jobs on 
a single machine. While the problem can be approximated within a constant factor for 
standard constraints, the inapproximability result stated in Theorem 3 shows that this 
is not the case for AND/OR constraints unless P=NP. Therefore, it is not possible to 
transform the AND/OR constraints into standard precedence constraints such that the 
approximation ratio for the problem is preserved. In addition, there seems to be a dif- 
ference in the approximability of the weighted and the unweighted case for AND/OR 
precedence constraints. For standard precedence constrained jobs, Woeginger proves 
in [21] that the best possible approximation ratio for minimizing the total weighted 
completion time is the same for the general problem as well as for a number of spe- 
cial cases including unit processing times, unit weights, and combinations thereof. For 
AND/OR precedence constrained jobs, the inapproximability result of Theorem 3 and 
the O ( \/v ) - ap p ro x i m at i o n algorithm for the unweighted case of Theorem 2 suggest that 
a gap may exist between the hardness of approximating the total weighted and the total 
unweighted completion time. We also could not prove any inapproximability result for 
l|ao-prec| Y Cj. It is still possible that there exists a constant or at least logarithmic 
approximation algorithm for the problem l|ao-prec| Y Cj. 
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Abstract. We present two new combinatorial approximation frame- 
works that are not based on LP-duality, or even on linear programming. 
Instead, they are based on weight manipulation in the spirit of the local 
ratio technique. We show that the first framework is equivalent to the LP 
based method of dual fitting and that the second framework is equivalent 
an LP-based method which we define and call primal fitting. 

Our equivalence results are not unlike the recently proven equivalence 
between the (combinatorial) local ratio technique and the (LP based) 
primal-dual schema. Although equivalent, the local ratio approach is 
more intuitive, and indeed, several breakthrough results were first ob- 
tained within the local ratio framework and only later translated into 
primal-dual algorithms. We believe that our frameworks may have a 
similar role with respect to their LP-based counterparts (dual fitting 
and primal fitting). Also, we think that the notion of primal fitting is of 
independent interest. 

We demonstrate the frameworks by presenting alternative analyses to the 
greedy algorithm for the set cover problem, and to known algorithms for 
the metric uncapacitated facility location problem. Also, we analyze a 
9-approximation algorithm for a disk cover problem that arises in the 
design phase of cellular telephone networks. 



1 Introduction 

This paper is motivated by two prominent papers dealing with the metric un- 
capacitated facility location (MUFL) problem. Both papers provide algorithms 
based on linear programming (LP) duality that deviate from the standard pri- 
mal dual approach. In the first, Jain and Vazirani [16] present a 3-approximation 
algorithm for MUFL which deviates from the standard primal-dual paradigm in 
that it does not employ the usual mechanism of relaxing the dual complementary 
slackness conditions, but rather it relaxes the primal conditions. In the second 
paper Jain et al. [15] develop two dual fitting algorithms for MUFL, with ap- 
proximation factors of 1.861 and 1.61. Dual fitting algorithms produce a feasible 
primal solution and an infeasible dual solution such that: (1) the cost of the dual 
solution dominates the cost of the primal solution, and (2) dividing the dual so- 
lution by an appropriately chosen r results in a feasible dual solution. These two 
properties imply that the primal solution is r-approximate. This contrasts with 
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the standard primal-dual approach, in which a feasible dual solution is found and 
used to direct the construction of a primal solution. The dual fitting method was 
(implicitly) used previously to analyze the greedy 7J n -approximation algorithm 
for set cover [17, 11]. (H n is the nth harmonic number, i.e., H n = £" =1 t) 
MUFL and other facility location problems have received a lot of attention 
from the approximation community recently. A survey by Shmoys [19] describes 
some of the techniques used to approximate MUFL and the k-median problem. 
The best approximation factor for MUFL to date is 1.52 [18]. A lower bound 
(assuming NP $7 DTIME(?i°( loslogn ))) of 1.463 is due to Guha and Khuller [13]. 

In this paper we present two combinatorial approximation frameworks that 
are not based on LP-duality, or even on linear programming. Rather, they are 
based on weight manipulation in the spirit of the local ratio technique [5,4]. Both 
frameworks use the following idea. Two weight functions w\ and W 2 are used, 
where W 2 = r ■ w\, and the weight with respect to w i of the solution found is 
shown to be less than or equal to the optimum with respect to w 2 - 

In the remainder of this section we briefly describe the primal-dual schema 
and the dual fitting approach. In Section 2 we present our first framework and 
show that it is equivalent to dual fitting. We demonstrate this framework by 
providing alternative analyses of the greedy algorithm for set cover and the Jain 
et al. [15] 1.861-approximation algorithm for MUFL. In Section 3 we present 
our second framework and define a method called primal fitting , which we show 
is tantamount to it. We demonstrate our second framework and the notion of 
primal fitting by analyzing two algorithms that where originally designed within 
the primal-dual paradigm: the Jain-Vazirani [16] 3-approximation algorithm for 
MUFL, and a 9-approximation algorithm of Chuzlroy [10] for a disk cover prob- 
lem arising in the design phase of cellular telephone networks. 

Our equivalence results are not unlike the recently proven [6] equivalence be- 
tween the (combinatorial) local ratio technique and the (LP based) primal-dual 
schema. Though equivalent, the local ratio approach is more intuitive — indeed, 
several breakthrough results such as the first constant factor approximation al- 
gorithm for feedback vertex set [1, 7, 9] and the first local ratio/primal-dual algo- 
rithms for maximization problems [2, 3] were first obtained within the local ratio 
framework and only later interpreted as primal-dual algorithms. We believe that 
our frameworks may have a similar role with respect to their LP-based counter- 
parts. Also, we think that the notion of primal fitting is of independent interest. 

1.1 Primal-Dual Algorithms 

Consider the following linear program and its dual: 

min £"=i wjXj max £” =1 biyi 

s.t. £" =1 dijXj > bi Mi £ {1, ... , to} s.t. £" =1 atjUj < Wj Mj £ (1, . . . , n} 

Xj >0 Vj £ {1, . . . , n} Vi > 0 Mi £ (1, ... , m} 

A primal-dual r-approximation algorithm is an algorithm that constructs an 
integral primal solution x and (possibly implicitly) a dual solution y, such that 
x and y satisfy 
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wx < r ■ by. (1) 

It follows, by weak duality, that x is r-approximate. One way to find such a pair 
of primal and dual solutions is to focus on pairs ( x , y) satisfying the following 
relaxed complementary slackness conditions (for appropriately chosen rq and r 2 ): 

Relaxed Primal: Vj, Xj > 0 => Wj/ri < Y^iLi a ijVi — w j ■ 

Relaxed Dual: Vi, yi > 0 => bi < ]P ’ l = i a ij x j — r 2 • bi- 

These conditions imply that x is (rq ■ /^-approximate since 

n n / m \ min \ m 

W 3 X 3 < H 7 '1 ' ( /L a ijVi ) X 3 = Tl ‘ X! ai 3 X 3 I < C ' >2 ' X! 

j=l j=l Vi— 1 / *=1 \j— 1 / i— 1 

In most primal-dual algorithms only the dual complementary slackness con- 
ditions are relaxed, i.e., n = 1. (See, e.g., [12,8].) Typically, such an algorithm 
begins with the infeasible primal solution x = 0 and the feasible dual solution 
y = 0. It then iteratively increases dual variables and “improves” the feasibility 
of the primal solution, ending with a primal/dual pair satisfying the relaxed dual 
conditions. (Sometimes a “cleanup” phase is also needed). This design method 
is commonly referred to as the primal-dual schema. 

1.2 Dual Fitting 

The dual fitting method is a variation on the primal-dual theme. The idea is to 
construct a primal/dual solution pair, but this time the dual solution is infeasible 
and its cost dominates the cost of the primal solution. The key point is that the 
dual solution has the property that dividing it by r yields a feasible solution. 
Thus, this shrunk dual and the primal solution satisfy Inequality (1), and hence 
the primal solution is r-approximate. The analysis of a dual fitting algorithm 
consists therefore of two parts: (1) showing that the dual solution dominates the 
primal solution (cost- wise), and (2) showing that dividing the dual solution by 
r makes it feasible, or rather, finding the smallest r for which this is guaranteed. 

The dual fitting method was introduced as a design technique by Jain et 
al. [15]. They recounted Clrvatal’s work on set cover [11] and showed how the 
factor H n can be obtained as the solution to the optimization problem mentioned 
at the end of the previous paragraph. Following that they described two dual 
fitting algorithms for MUFL achieving approximation factors of 1.861 and 1.61. 

2 A Combinatorial Framework Equivalent to Dual Fitting 

In this section we show that the dual fitting approach can be described in purely 
combinatorial terms without recourse to linear programming. We demonstrate 
this on Chvatal’s set cover algorithm in full detail, and discuss the principles of 
our approach at some length. We then sketch briefly how the same can be done 
for the Jain et al. algorithms for MUFL [15]. 
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2.1 Set Cover 

In the set cover problem we are given a collection of non-empty sets C = 
{Si, . . . , S m } and a non-negative weight function w on the sets. Set S, is said 
to cover element u\iu€ Si. The goal is to find a minimum- weight collection of 
sets that together cover all elements in the universe U = UHi Si- 

Recall that the greedy algorithm for set cover selects in each iteration the set 
Si minimizing the ratio between its cost and the number of elements covered by 
it that have not been covered in previous iterations. Algorithm SC (appearing 
below) is our reformulation of the greedy algorithm. To explain it, we find it 
convenient to imagine sets being paid by elements in order to cover them. The 
method of payment is that each element issues checks for various amounts to 
the sets containing it, but these checks are not immediately cashed. Rather, a 
set cashes its checks (and adds itself to the solution) only when the total sum of 
these checks is high enough to cover the set’s cost. Also, elements occasionally 
retract some of the (as yet uncashed) checks they have issued. 

More specifically, the algorithm proceeds as follows. At the beginning of each 
iteration a value e > 0 is determined, and every uncovered element issues a check 
for the sum of e to every set containing it. If as a result a set Si becomes fully 
funded by the checks in its possession, it cashes these checks and adds itself to the 
solution. (If several sets become fully funded, one is selected arbitrarily.) Every 
newly covered element then retracts all of the checks it has given to other sets. 
This process continues until all elements get covered. Intuitively, the motivation 
for retracting checks is that each element is willing to pay to get covered, and 
when several elements get covered by a given set, they expect to split the set’s 
cost among them evenly. However, a given element does not know at the outset 
which set will eventually cover it, so it distributes (equal sum) checks to all 
of the sets containing it. When the element finally gets covered it retracts the 
uncashed checks it has issued, since it has no desire to help cover other elements. 
The outcome is that every element eventually contributes towards the cost of 
exactly one set — the first set containing it that is added to the solution — and 
these contributions add up to the total cost of the solution. 

We now relate the above description of the algorithm to the pseudo-code 
below. The algorithm keeps track of the elements already covered by deleting 
them from all sets in which they are members and discarding sets that become 
empty. The algorithm keeps track of the checks issued to a set by adjusting the 
set’s weight: when a check is received by a set, the set’s weight is decreased by 
the amount written on it, and when the check is retracted, the set’s weight is 
increased by the same amount. The algorithm keeps track of the total sum of 
uncashed checks issued by an uncovered element to a set containing it by means 
of a variable A. Since at every moment this amount is simply the sum of es 
used up to that moment, a single variable suffices for all elements. Finally, in 
anticipation of the analysis, the algorithm conceptually creates and manipulates 
(in Lines 2 and 7) a new weight function w® which is initially set to r-w, for some 
r to be determined later. We stress that this is not really part of the algorithm, 
and is only included for purposes of the analysis. 
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Algorithm SC 

1. Z\ 0. 

2. Conceptually: ic® <— r ■ w. 

3. While the problem instance is not empty do: 

4. e<-min i {w(S'j)/|S' i |}. 

5. Define a weight function <5 by: Vi, <5(S)) = e • 1 5”^ | . 

6. w i — w — S. (Note that by the choice of e, the 

weight of at least one set drops to 0.) 

7. Conceptually: w $ <— w* — S. 

8. A i — A -f- e. 

9. Select a set Si such that w(Si) = 0. 

10. For all i ^ l : w(Si) w(Si) + A • | Si fl Si\. 

11. Add Si to the solution and remove the elements in Si from 
the problem instance. Discard sets that become empty. 



In each iteration every set receives a budget proportional to its current size, 
and the set minimizing the ratio between its weight and size is selected. The 
retraction of checks ensures that at all times the ratio weight / size of a given set 
is the ratio that would be calculated in the original greedy algorithm minus A, 
aud therefore the set minimizing this ratio is the same in both algorithms. Thus, 
Algorithm SC is indeed a restatement of the greedy algorithm. 

As mentioned earlier, the analysis uses a new weight function w$. Denoting 
by Opt and Opt* the optimum value with respect to w and tc®, we have Opt* = 
r • Opt. We first show that if ic* remains non-negative throughout the execution 
of the algorithm, the solution is r-approximate. Then we show that r = H n , 
where n = \U\, ensures this. This implies an approximation guarantee of H n . 

Let a notation with subscript j denote the appropriate object in the jth 
iteration. Also, let \S, \,- be the size of S t in the jth iteration. Let t be the number 
of iterations and let j{u ) denote the iteration in which element u is covered. When 
element u gets covered by a set S, it retracts the checks it has issued to other 
sets and gets deleted from the problem instance. Thus the amount u ends up 
actually paying is the total sum of checks it has passed to S, i.e. , e j- Since 

the solution returned by the algorithm is fully funded by these payments its 
cost is Yhueu 1jj=i e j = l Wj\- tj- Now consider Opt*. In the jth iteration 
the cost with respect to w $ of every set Si decreases by e,j ■ Consequently, 
since every solution must cover all elements, Opt® decreases by at least ej ■ \ U 7 \ . 
The deletion of elements further decreases Opt* since u>* is always nonnegative. 
Thus the total decrease in Opt* is at least e j ' I Uj\, and this decrease brings 
Opt down from its initial value r • Opt to zero (the optimum value for the 
empty instance). Thus Jjj=i Wj\ ’ e j — r ' Opt. 

We have thus shown that if u>* remains nonnegative, the solution is r- 
approximate. It remains to find the smallest r ensuring this. Finding r is an 
optimization problem, which (following Jain et al. [15]) we tackle in the follow- 
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ing manner. For a given set S, the total decrease in w $ (S) equals the total sum 
of the checks given to S (including ones that were eventually retracted). Denote 
the elements of S by ui , . . . ,Ud, indexed in the order by which they were cov- 
ered, and let x, be the total sum of the checks given to S by u, . A sufficient 
condition on r is r > X^=i x > f° r possible choices of d, w(S ), and the XiS 
consistent with the algorithm, so we need to examine a worst case scenario in 
which the ratio is maximized. What constraints can we impose on the XjS? One 
set of constraints (which turns out to be all we need) is the following. Consider 
the moment u,: gets covered (just before any check retractions occur). Till that 
moment S has received an as yet unretracted sum of Xi from each of the d — i + 1 
elements u*, Uj+i, . . . , u<j. Since no set is ever in possession of checks exceeding its 
cost, we get Xi < w(S) /{d — * + 1). Thus for fixed d and w(S), our optimization 
problem is ma x{ X^ =1 x % '■ Vi, x i < d-i+i } ■ Clearly the optimum value is 

JT_i d-l+i = H d — Hn, which is our approximation guarantee. 

Put abstractly, our combinatorial framework is the following. First establish 
(conceptually) a second weight function u;® = r • w. Then manipulate the weights 
of the objects in the problem instance such that a zero cost feasible solution S 
emerges. At the same time manipulate (conceptually) the u>® weights accordingly 
so that Opt® decreases by at least w(S) but never becomes negative. If r is 
chosen such that this is indeed possible, S will be r-approximate. 

Naturally, actual implementations of the framework have more structure built 
into them. In Algorithm SC we attributed the weight subtractions to the ele- 
ments in such a way that every element was made responsible for a well defined 
contribution to the change in w and w®. The decrease in w and w® in the ft li it- 
eration was done uniformly (from the perspective of the elements) , ensuring that 
the decrease in Opt could be charged to the surviving elements, at least e,; to 
each. The increases in w in the same iteration guaranteed that no element ended 
up contributing more than e.j (in that iteration) towards covering the cost of 
the solution. Thus the elements forge the link between the “budget” supplied by 
Opt and the cost of the solution. This approach is quite straightforward and 
works well for covering problems. It can be expected that more sophisticated 
charging schemes will be developed to cope with non-covering problems. 

2.2 Discussion 

Consider the following LP relaxation of set cover, and its dual: 

min YJiL\ w ( S i) x i max YhL i Vu 

s.t. Yji-.ues x i> 1 Vu e U s.t. J2 u es U™ ^ w ( s i) VI <i<m 
Xi > 0 VI < i < m y u > 0 Vu £ U 

Algorithm SC can be seen as a dual fitting algorithm by changing Line 2 
to “Conceptually: set all dual variables to 0” and Line 7 to “Conceptually: 
Vu £ U, y u <— y u +e.” By the same arguments as in our analysis of the algorithm, 
it is not difficult to see that the cost of the solution equals the value of the dual 
objective function (y u is the amount paid by u to the set that covers it first). 
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Similarly, the problem of finding the smallest r such that y/r is feasible can be 
formulated as the LP we have developed in the previous section [15]. 

The relation between the dual fitting interpretation of Jain et al. [15] and 
our combinatorial interpretation is no coincidence. Bar- Yehuda and Rawitz [6] 
showed (in the context of the primal-dual schema) that increasing a dual variable 
by e is equivalent to subtracting the weight function obtained by multiplying 
the coefficients of the corresponding primal constraint by e from the primal 
objective function. Here this is manifested as the equivalence between increasing 
the dual variables and subtracting S from uA Also, dividing y by r is equivalent 
(feasibility-wise) to multiplying the right hand sides of the dual constraints by 
r, which is equivalent to multiplying the primal objective function by r. 

Thus, the translation of a dual fitting algorithm to a purely combinatorial one 
is as follows. First define ic® = r-w. Then, every time a dual variable is increased 
by e, multiply the coefficients of the corresponding primal constraint by e and 
subtract them from uA The dual fitting analysis argues that the (infeasible) 
dual solution upper bounds the primal one. The same proof, properly rephrased, 
shows that the decrease in Opt upper bounds the solution’s cost. The second 
part of the dual fitting analysis is to find r such that dividing the dual solution 
by r makes it feasible. The proof of this can be used to show instead that w $ 
remains non-negative. 

We remark that although the problem of finding the best r is formulated as 
an LP (actually, as a family of LPs — one for each choice of d), this is merely 
a convenient way to express it. We have made no use of the theory of linear 
programming in its solution. Thus the term factor revealing LP applied by Jain 
et al. [15] to this LP is a bit inexact, as no real linear programming is involved. 



2.3 Metric Uncapacitated Facility Location 

In the uncapacitated facility location problem we are given a set F of facilities, 
and a set C of cities. Each facility i has an opening cost w(i) > 0, and each pair 
of facility i and city j is associated with a connection cost w(j,i) > 0. Our goal 
is to open some facilities and connect each city to one of them while minimizing 
the total cost. In the metric uncapacitated facility location problem (MUFL) the 
connection costs satisfy the triangle inequality. 

Jain et al. [15] restated MUFL as a special case of set cover in the following 
terms. A star S consists of a single facility i and a subset C' of the cities. The cost 
of S is u>s = w{i) + Ylj^c' w (p *)• The problem is to find a minimum weight set of 
stars containing all cities. Jain et al. [15] describe an algorithm for MUFL that is 
essentially Algorithm SC. They show that it returns 1.861-approximate solutions 
on MUFL instances. They obtain this low factor by adding constraints (peculiar 
to MUFL) to the problem of finding the best r. The additional constraints stem 
from the triangle inequality and from the fact that the contribution of each city 
participating in a star can be partitioned into payment for its connection to 
the facility and payment for opening the facility, and no city ever pays for the 
connection of another city. (See [15].) 
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Jain et al. [15] also describe a second dual fitting algorithm for MUFL achiev- 
ing a factor of 1.61. In this algorithm cities may “change their minds” about 
which facility to connect to, and when they do so they transfer funds from one 
star to another. This algorithm too can be analyzed in purely combinatorial 
terms in the manner outlined in this section. (Despite the transferring of funds 
between stars, the key point remains. In the ith iteration each city is responsible 
for a decrease of e* in Opt and is charged e, for the cost of the solution.) 

We remark that Jain et al. [15] do not use linear programming in the “factor 
revealing” stage of either algorithm. They do, however, recount solving the LP 
for small values of d in order to get an estimate of the general optimal solution. 

3 Primal Fitting 

In [16] Jain and Vazirani present a primal-dual 3-approximation algorithm for 
MUFL in which they relax the primal complementary slackness conditions rather 
than the dual conditions. In a sense, this is the mirror image of the standard 
primal-dual approach, but there is another way to view it. The conceptual dif- 
ference between primal-dual and dual fitting is that in standard primal-dual 
algorithms the “budget” supplied by the dual solution is inflated in order to 
cover the cost of the primal solution, whereas in dual fitting algorithms the un- 
slirunk dual covers the weight of the primal solution (but only by shrinking it 
does it become feasible). In Jain and Vazirani’s algorithm the dual solution cov- 
ers the weight of an infeasible primal solution which is then inflated to make it 
feasible. (This, of course, is our interpretation of their algorithm. It is not how 
they describe it.) This idea seems befitting of the name primal fitting. 

As with dual fitting, we show that primal fitting can be described in com- 
binatorial terms, avoiding linear programming completely. We demonstrate this 
on Jain and Vazirani’s algorithm and on a disk covering problem. 



3.1 Metric Uncapacitated Facility Location 

Algorithm MUFL below is our reformulation of Jain and Vazirani’s algorithm 
(with minor modifications) . Here too, it is convenient to imagine the cities paying 
for opening and connecting to facilities, but this time we prefer to think in 
terms of continuous, unit rate, flows of funds rather than discrete payments. 
The algorithm proceeds as follows. Initially, all cities are active. At a given 
moment every active city directs a stream of money towards each facility. The 
money flowing from city j towards facility i is first used to pay the connection 
cost w(j,i), and is said to be received by the connection. When the connection 
cost is covered, the city is said to have reached the facility, and subsequently 
the money flow from city j to facility i is used to cover the opening cost of 
facility i, and is said to be received by the facility. Thus a typical active city 
will be simultaneously funding connection costs to several facilities and opening 
costs of others, and a typical facility will be receiving funds simultaneously from 
zero or more cities (that have reached it). As time passes, two types of events 
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occur (multiple events may occur simultaneously). The first is when the total 
sum received by some facility i reaches its opening cost w(i). Facility i is then 
declared open 1 , and all cities that have reached it, either at that exact moment, or 
earlier, are made inactive. The second event type is when an active city reaches 
an open facility. The city is then made inactive. When the last city becomes 
inactive, the algorithm retains a certain subset of the open facilities (chosen in 
a manner described below) and closes all others. Each city now connects to the 
nearest open facility (i.e., it uses the cheapest connection to an open facility), 
and conceptually retracts the payments it has made to all other facilities and 
connections to them. To describe the subset of facilities that remain open we need 
the following terminology. A city contributes to a facility at a given moment if the 
facility has received from it a nonzero amount of money. Two facilities conflict 
whenever a city contributes to both. A set of facilities is independent if no two of 
them conflict. The algorithm keeps open an arbitrary maximal (under inclusion) 
independent subset of the open facilities. Such a set can be found by scanning 
the open facilities in arbitrary order and closing every facility conflicting with a 
previously considered facility that has not been closed. 

Algorithm MUFL below discretizes time at the events. It keeps track of 
payments by subtracting weights, and of inactive cities by deleting them. It 
keeps track of the cities contributing to each facility i by means of sets C). 



Algorithm MUFL 

1. Ci <— 0 for all i G F. 

2. While C ^ 0 do: 

3. Let e be maximal such that the weight function c> defined below 
satisfies w — 5 > 0. 

Vj eCVieF, 5(j, i) = min {e, w(j, i)}, 

Vi G F, S(i) = Ejec( £ ~ min i e > W (F *)})• 

4. w 4 — w — 5. 

5. For all i G F do: 

6. Ci t— Ci U {j G C : e — 5(j , i) > 0}. 

7. C «— C \ {j G C : 3i G F such that w(j, i) = w(i) = 0}. 

8. Let O = {i G F : w(i ) = 0}. 

9. Select a maximal set O' C O such that C), n Ci 2 = 0 for 
all ii, i 2 G O', ii 7^ i- 2 - 

10. Open the facilities in O', close all others, and connect each city j 
to a facility arg min {icq (j* i) : i G O'}, where wq is the original 
weight function. 



For the analysis, define a new weight function tu* = w/r. Let us conceptually 
perform the same weight subtractions on w® as on w, and let us perform the 
conceptual retractions in the final stage on w $ only. Let S be the solution re- 



1 Jain and Vazirani use the term temporarily open. 
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turned by the algorithm, let Opt be the optimum value, and let a notation with 
subscript / denote the appropriate object in the Zth iteration. In the Ztli iteration 
each surviving city sends two checks, totaling ei, in the direction of every facility 
i. One is received by the connection between j and i, and the other by facility 
i. The sums written on the checks (which may be 0) are deducted from the cor- 
responding weights. Consequently the cost of every feasible solution, and hence 
Opt, drops by at least e; • \Ci\, since every feasible solution opens facilities and 
connect each city to one of them. Additionally, deleting cities can only further 
decrease Opt. Now consider the effect of the Ith iteration on w $ (S). The ac- 
tual drop in w®(S) might be quite large, but if we factor in the fund retractions 
of the final stage, we can “adjust” the drop in the Ith iteration and charge it 
to the surviving cities, at most ei per city. Thus the adjusted drop is at most 
erlCil, and therefore the total drop in Opt upper bounds the total drop (minus 
retractions) in w $ (S). If r is such that the final value of w $ (S) is non-negative, 
then w $ (S) < Opt (where the inequality refers to the initial values of w $ (S) 
and Opt) since Opt never drops below zero, as w remains non-negative. This 
implies an approximation factor of r. 

We now argue that Setting r = 3 guarantees this. Upon termination, the 
tc-cost of the facilities open in S is zero. Since they are independent, they do 
not suffer check retractions, so their w®-cost is non-positive as well. The same 
argument cannot be applied to the connection costs, however, since cities do not 
necessarily connect to facilities they have reached. Nonetheless, for every city j 
there exists an open facility i in S such that the connection between j and i 
has received checks in the amount of at least | w(j,i ) = (See Lemma 5 

in [16] for a straightforward proof of this fact, based on the triangle inequality.) 
Since j issues checks uniformly in all directions, all connections leading out of j 
that are not fully funded have received the same amount. Thus, the connection 
chosen for j (which is of minimum cost) has certainly received at least one third 
of its cost, so the u>®-cost of this connection is non-positive upon termination. 

Two remarks. Our first remark regards the asymmetry in the treatment of fa- 
cilities and connections, namely, that connections might be funded to as little 
as a third of their actual cost, whereas facilities are always fully funded. This 
disparity is not fundamental — the analysis remains virtually unchanged if we 
declare a facility open when it has received one third of its opening cost. The 
reason Jain and Vazirani insist on fully funded facilities relates to their use of 
Algorithm MUFL in the context of the k-median problem [16], where it is im- 
portant to fully fund the facilities. Our second comment concerns a somewhat 
displeasing aspect of the algorithm, namely, that facilities first open and then 
may close, and cities only get connected at the very end. This is so because Jain 
and Vazirani apparently modeled their algorithm on the dual ascent /reverse 
delete pattern common to many primal-dual algorithms, but again, it is not 
fundamental aspect of the algorithm. A perhaps more appealing variant of the 
algorithm exists, which constructs a solution in a single pass by opening facilities 
and connecting cities on the fly. We defer the details of this algorithm to the full 
version of this paper. 
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Our approach can be generalized as follows. Define a new weight function 
w$ = w/r. Now, assume that w and vfi are manipulated in such a way that (1) w 
remains non-negative; (2) the decrease in Opt dominates the decrease in w $ (S), 
where S is the solution returned by the algorithm; and (3) iu^(S) drops to zero or 
less. Then, the solution found is r-approximate. Just as with our combinatorial 
“dual fitting” framework, specific applications are more structured. For example, 
in Algorithm MUFL the cities are charged individually for the drop in Opt and 
the drop in w $ (S). 

A Primal Fitting Interpretation. Consider the following LP relaxation of 
MUFL and its dual: 

min E jec.ieF l ) x ii + Ei G F max E je c a i 

s.t. E iGF X H > 1 Vj G C S.t. atj - Pji < w(j, i) Vj ec,ie F 

Vi - Xji >0 Vj €C,i€ F Pa ^ w (*) e F 

Xji , yi > 0 Vj G C,i G F a jiPji >0 Vj G C,i € F 

We construct a dual solution (a, (3) and a fractional , infeasible, primal solution 
(ay y) as follows. We use the following terminology. A facility is open whenever 
the corresponding dual constraint is tight; a facility is reached by a city whenever 
the dual constraint corresponding to the pair is tight; a city is active whenever no 
open facility is reached by it. Starting with (a, /?) = (0, 0), we increase uniformly 
the ays corresponding to the active cities. (When a city becomes inactive, we 
stop increasing its dual variable.) As the ays increase, each /3y; responds to the 
change in ay so as to maintain feasibility, i.e., we set it to max {0, ay — w(j, i)}. 
Additionally, the ayys and y t s respond to the changes in (a,/3) so that ay; = 
(ay — f3ji) /w(j, i) and y* = EyeC We keep going until all cities become 

inactive. Then we choose a maximal independent set O' of open facilities, where 
a set of open facilities is independent if for every city j there is at most one 
facility i in the set such that /3ji > 0. We now select for each city j the nearest 
facility in 0'(breaking ties arbitrarily) and denote it i(j). We zero out all primal 
variables yi corresponding to facilities not in O' and all primal variables ay.; such 
that i ^ i(j). 

The equivalence between this description and Algorithm MUFL is straight- 
forward. At every moment ay records the amount of money directed by city j 
towards each of the facilities till that moment. For each facility i, (3ji of this 
amount was received by the facility and the remainder by the connection. The 
primal variables record the receipt of funds as well: j/; is the fraction of facil- 
ity V s opening cost received by i, and ay,; is the fraction of the connection cost 
from j to i received by the connection. Clearly, the dual solution is feasible at 
all times. Also, the choice of O' in the final step as an independent set of open 
facilities ensures that the final value of (x, y) is bounded by the final value of 
(a, pi). Finally, Lemma 5 in [16] implies that ay yy) >1/3 for all cities j. Thus 
multiplying the primal solution by 3 (and rounding down to 1) makes it feasible. 

Note that the above description is slightly different from the one given in [16]. 
Specifically, instead of defining the primal solution after the algorithm termi- 
nates, we construct it together with the dual solution. Thus during the first stage 




148 



Ari Freund and Dror Rawitz 



of the algorithm each primal variable measures the tightness of the correspond- 
ing dual constraint — its value is the ratio between the left hand side and right 
hand side of the constraint. In the standard primal-dual setting (and in the dual 
fitting method) we only consider for the solution elements whose correspond- 
ing dual constraints are tight. In the primal fitting setting we consider elements 
whose corresponding dual constraints are weakly tight, i.e., their tightness is at 
least 1/r (in the above measure), where r is the algorithm’s performance guar- 
antee. The equivalence between primal fitting and our combinatorial framework 
stems from this observation and from the equivalence between increasing dual 
variables and subtracting weight functions. 

3.2 Disk Cover 

In the design phase of cellular networks the geographical area is surveyed and 
m potential base-station locations are identified. Each base-station covers a disk 
shaped area (centered at the station) whose radius can be set arbitrarily large 
(by increasing the station’s output power). The cost of operating base station b 
with radius r is proportional to the energy measure : it is w(b) ■ r(b ) 2 , where w(b) 
is a coefficient associated with b. Given n client stations, the problem is to select 
a minimum cost set of base-stations with corresponding radii to cover all client 
stations. Henceforth we refer to the locations of the client stations (which are 
points in the plane) as points. Note that the set of radii can be discretized — there 
are at most n “interesting” radii per base-station. Therefore, no more than nm 
disks need be considered. We denote the set of these disks by D. We denote the 
center of disk d by c(d), its radius by r(d), its cost (or weight) by w(d), and the 
number of points it covers (or contains) by |d|. 

Clruzhoy developed an unpublished 9-approximation algorithm for the prob- 
lem [10]. We present her algorithm here (with permission), reformulated as Al- 
gorithm DC below. The idea is that in each iteration every active point issues a 
check in the amount of e to every disk containing it (initially all points are active) 
and when a disk becomes fully funded the points it contains become inactive. 
Thus when all points become inactive, they are all covered by fully funded disks. 
However, as usual, a point is only willing to honor the checks it has given to one 
of the disks. Therefore, we construct the solution by selecting a set of disks such 
that at least one ninth of the cost of each disk can be ascribed to payments made 
by the points, such that no point contributes to more than one disk. This is done 
by the following procedure. Let X be the set of all fully funded disks. Iteratively 
delete from X the disk d* with maximum radius and every disk intersecting it, 
and add to the solution the smallest disk d' (which need not be in X) centered 
at c(d*) that covers all of the points contained in those deleted disks. Note that 
r(d') < 3 r(d*) since d* is of maximum radius in X, and thus w(d') < 9 w(d*). 
Therefore, since d* is fully funded by the points contained in it, d! is at least 
one ninth funded by these same points. Since all disks intersecting d* are deleted 
from X, the points will not be charged more than once. Conceptually, then, all 
points contained in d! retract all of the checks they have issued, except that the 
points contained in d* do not retract the checks they have given d' . 
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Algorithm DC 

1. While points remain do: 

2. e <— min{w(d)/ \d\ : d € D}. 

3. Define the weight function 5 by: Vd G D, 5(d) = e ■ |d|. 

4. w i — w — 5. 

5. Delete all points covered by zero-weight disks. 

6. S<-0. 

7. A e- {d G D : w(d) = 0}. 

8. While A ^ 0 do: 

9. d* G- arg max {r(d) : d G X}. 

10. Y <— {d G X : d and d* intersect}. 

11. a •<- a \ y. 

12. Add to S the minimum radius disk centered at c(d*) 
that covers all points contained in disks d G Y. 

13. Return S. 

Define = w/9 and assume that the weight subtractions made in Line 4 are 
also made with respect to u> $ , and that the conceptual retractions are applied 
only to uA In a given iteration Opt drops by at least e times the number of 
surviving points, and as implied by our explanation of the algorithm, w®(S) 
drops by at most the same amount (when the check retractions are factored 
in). Also, Opt never drops below zero, and as implied by our explanation of 
the algorithm, w®(S) drops to zero or less (even when the check retractions are 
taken into account). Thus we obtain an approximation factor of 9. 



A Primal Fitting Interpretation. We use the set cover LP relaxation of the 
problem (disks are sets; points are elements). The dual variables increase uni- 
formly, each ceasing to increase whenever a dual constraints involving it becomes 
tight, until all variable cease to increase. At the same time each primal variable 
increases to reflect the ratio between the left hand side and right hand side of 
the corresponding dual constraint. The set A consists initially of all disks whose 
corresponding variables are 1. Our analysis above can be used to show that for 
disks added to S, the corresponding variables are set to at least 1/9, and that 
if we zero out all other primal variables, the dual objective value upper bounds 
the primal objective value. 

Acknowledgments 

We thank Reuven Bar- Yehuda and Juila Chuzhoy for helpful discussions. 

References 

1. V. Bafna, P. Berman, and T. Fujito. A 2-approximation algorithm for the undi- 
rected feedback vertex set problem. SIAM Journal on Discrete Mathematics, 
12(3/289-297, 1999. 





150 



Ari Freund and Dror Rawitz 



2. A. Bar-Noy, R. Bar- Yehuda, A. Freund, J. Naor, and B. Shieber. A unified ap- 
proach to approximating resource allocation and schedualing. Journal of the ACM, 
48(5):1069-1090, 2001. 

3. A. Bar-Noy, S. Guha, Y. Katz, J. Naor, B. Schieber, and H. Shachnai. Throughput 
maximization of real-time scheduling with batching. In 13th Annual ACM-SIAM 
Symposium on Discrete Algorithms, pages 742-751, 2002. 

4. R. Bar- Yehuda. One for the price of two: A unified approach for approximating 
covering problems. Algorithmica, 27 (2) : 131- 144, 2000. 

5. R. Bar-Yehuda and S. Even. A local-ratio theorem for approximating the weighted 
vertex cover problem. Annals of Discrete Mathematics, 25:27-46, 1985. 

6. R. Bar-Yehuda and D. Rawitz. On the equivalence between the primal-dual schema 
and the local-ratio technique. In fth International Workshop on Approximation 
Algorithms for Combinatorial Optimization Problems, volume 2129 of LNCS, pages 
24-35, 2001. 

7. A. Becker and D. Geiger. Optimization of Pearl’s method of conditioning and 
greedy-like approximation algorithms for the vertex feedback set problem. Artificial 
Intelligence, 83(1):167- 188, 1996. 

8. D. Bertsimas and C. Teo. From valid inequalities to heuristics: A unified view of 
primal-dual approximation algorithms in covering problems. Operations Research, 
46(4):503-514, 1998. 

9. F. A. Chudak, M. X. Goemans, D. S. Hochbaum, and D. P. Williamson. A primal- 
dual interpretation of recent 2-approximation algorithms for the feedback vertex 
set problem in undirected graphs. Operations Research Letters, 22:111 118, 1998. 

10. J. Chuzhoy. Private Communication. 

11. V. Chvatal. A greedy heuristic for the set-covering problem. Mathematics of 
Operations Research, 4(3):233-235, 1979. 

12. M. X. Goemans and D. P. Williamson. The primal-dual method for approximation 
algorithms and its application to network design problems. In Hochbaum [14], 
chapter 4. 

13. S. Guha and S. Khuller. Greedy strikes back: Improved facility location algorithms. 
Journal of Algorithms, 31(l):228-248, 1999. 

14. D. S. Hochbaum, editor. Approximation Algorithms for NP-Hard Problem. PWS 
Publishing Company, 1997. 

15. K. Jain, M. Mahdian, E. Markakis, A. Saberi, and V. Vazirani. A greedy facility 
location algorithm analyzed using dual- fitting with factor-revealing LP. To appear 
in Journal of the ACM, 2003. 

16. K. Jain and V. V. Vazirani. Approximation algorithms for metric facility location 
and k-median problems using the primal-dual schema and Lagrangian relaxation. 
Journal of the ACM, 48(2):274-296, 2001. 

17. L. Lovasz. On the ratio of optimal integral and fractional covers. Discreate Math- 
ematics, 13:383-390, 1975. 

18. M. Mahdian, Y. Ye, and J. Zhang. Improved approximation algorithms for metric 
facility location problems. In 5th International Workshop on Approximation Al- 
gorithms for Combinatorial Optimization Problems, volume 2462 of LNCS, pages 
229-242, 2002. 

19. D. Shmoys. Approximation algorithms for facility location problems. In 3rd Inter- 
national Workshop on Approximation Algorithms for Combinatorial Optimization 
Problems, volume 1913 of LNCS, pages 27-33, 2000. 




On the Approximability of the Minimum 
Fundamental Cycle Basis Problem 



Giulia Galbiati 1 and Edoardo Amaldi 2 

1 Dipartimento di Informatica e Sistemistica 
Universita degli Studi di Pavia, 27100 Pavia, Italy 

giulia. galbiatiOunipv . it 

2 Dipartimento di Elettronica e Informazione 
Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy 

amaldiOelet . polimi . it 



Abstract. We consider the problem of finding a fundamental cycle ba- 
sis of minimum total weight in the cycle space associated with an undi- 
rected biconnected graph G, where a nonnegative weight is assigned to 
each edge of G and the total weight of a basis is defined as the sum 
of the weights of all the cycles in the basis. Although several heuristics 
have been proposed to tackle this NP-hard problem, which has several 
interesting applications, nothing is known regarding its approximability. 
In this paper we show that this problem is MAXSNP-hard and hence 
does not admit a polynomial-time approximation scheme (PTAS) unless 
P=NP. We also derive the first upper bounds on the approximability of 
the problem for arbitrary and dense graphs. In particular, for complete 
graphs, it is approximable within 4 + e , for any e > 0. 



1 Introduction 

Let G = ( V , E) be an undirected graph without loops or multiple edges and 
with a nonnegative weight w(e) assigned to each edge e € E. Associated with G 
there is a vector space over GF( 2), called the cycle space, consisting of the edge 
incidence vectors of all cycles (including the null cycle) and of all unions of edge- 
disjoint cycles of G. If G has n vertices, rn edges and p connected components, 
the dimension of this space is v(G) = m — n + p. A basis of the cycle space is 
called a cycle basis. Since the cycle space of a graph is the direct sum of the cycle 
spaces of its biconnected components, we assume G to be biconnected, i.e., G 
contains at least two edge-disjoint paths between any pair of vertices. 

When G is connected there are special cycle bases that can be derived from 
the spanning trees of G. If T is an arbitrary spanning tree of G, by adding 
anyone of the m — n + 1 edges of G which do not belong to T, the so-called 
chords, one creates a cycle and the set of these m — n + 1 cycles form a cycle 
basis, which is referred to as fundamental cycle basis. According to [20], a cycle 
basis B = {bi, ..., b m _ n+ 1 } of G is fundamental if and only if no b, only consists 
of edges belonging to other cycles of B. 

In this work we investigate the following combinatorial optimization problem. 
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MIN-FCB: Given a weighted graph G as above, find a fundamental cycle basis B 
of minimum total weight, i.e., which minimizes W(B) = X^i" +1 W(&») where 
W(bi ) is the sum of the weights of all edges in cycle bi. 

Interesting applications of MIN-FCB arise, for instance, in the testing of 
electrical circuits [5], the generation of minimal perfect hash functions (see [8] 
and the references therein), the coding of ring compounds [21], the planning of 
complex syntheses in organic chemistry [22] as well as in cyclic timetabling [13]. 

While the problem of finding a general (not necessarily fundamental) cycle 
basis of minimum total weight is solvable in polynomial time [11], in [7] MIN- 
FCB is shown to be NP-hard. Although several heuristics have been proposed 
for its solution (see [7, 8] and references therein), to the best of our knowledge, no 
performance guarantees have been established and nothing is known regarding 
its approximability. 

In this paper we prove that MIN-FCB is MAXSNP-hard and hence does 
not admit a polynomial-time approximation scheme unless P=NP. Using a re- 
sult in [19], relying on [2], we show that, for arbitrary graphs, the problem is 
approximable within a factor of 2 c ^'/ log ™ loglogn ). For dense graphs we derive a 
constant (respectively an O(logn)) factor if the number of edges of the comple- 
ment of G is 0(1) (respectively O(logn)). In particular for complete graphs we 
have a constant factor of 4 + e, for any e > 0. 

Extensively studied problems related to MIN-FCB include the Minimum 
Communication cost spanning Tree problem (MCT), introduced in [12], and the 
Minimum Routing cost spanning Tree problem (MRT), see e.g. [23]. In MCT, 
given a complete undirected graph G with a nonnegative weight w(e) assigned to 
each edge e £ E and a nonnegative communication requirement r(i,j) for each 
pair of vertices i, j, one looks for a spanning tree T of G which minimizes the total 
communication cost, i.e., the function comc(T) = JA j eV r(i, j)wT{i, j), where 
Wr(i,j) denotes the weight of the unique path in T joining the vertices i and 
j. MCT is known to be approximable within 2°('/ logTlloglog ™) even for graphs 
that are not complete [19]. In [23] it is shown that MCT with all requirements 
equal to 1, referred to as MRT, admits instead a polynomial-time approximation 
scheme even for graphs that are not complete. Although MIN-FCB was shown 
to be NP-hard by reduction from MRT [7] , the two problems differ substantially. 
While in MRT one minimizes the sum of the weight of the paths on the tree 
between all pairs of vertices, in MIN-FCB the sum is taken only over the pairs 
of vertices corresponding to the edges of the graph that do not belong to the 
tree and the weights of all these non-tree edges are also included in the objective 
function. 

The approximability of the bottleneck version of MIN-FCB, in which one 
looks for a fundamental cycle basis where the weight of the maximum cycle is 
minimum, has been recently addressed in [10]. 

In Section 2 we establish the main inapproximability result, while in Section 3 
we derive the above-mentioned approximability upper bounds. 
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2 Inapproximability Result 

To establish MAXSNP-hardness of MIN-FCB we proceed by L-reduction from 
the following version of the well-known maximum satisfiability problem for 
Boolean formulas, which is MAXSNP-complete [15]. 

MAX-2-SAT-3-B: Given a set U = {zq, u v } of Boolean variables and a 
collection C = {Ci, ...,C c } of disjunctive clauses with exactly 2 literals (i.e., 
a variable or its complement) per clause and with each literal occurring in C 
at most 3 times, find an assignment of Boolean values to the variables which 
satisfies as many clauses as possible. 

Since L-reductions preserve approximability within constant factors, an L- 
reduction from a MAXSNP-complete problem to MIN-FCB implies that the 
latter problem is MAXSNP-lrard. According to [1], MAXSNP-lrard problems 
do not admit a PTAS, i.e., they cannot be appproximatecl within every con- 
stant a > 1, unless P=NP. The reader is referred to [3, 15] and to the proof of 
Theorem 2 for the definition of L-reduction and for its use in this context. 




Fig. 1. Graph G' 



2.1 The Reduction 

For any given instance I of MAX-2-S AT-3-B, we construct a particular instance 
/' of MIN-FCB, i.e., we define a particular weighted biconnected graph G. 

We start the construction of G from the graph G given in Figure 1, where the 
weight assigned to each edge is indicated and the constants e, k and M will be 
defined appropriately in (1) and (2). In order to obtain G from G , for each clause 
Ci £ C we add to G two vertices c* and c l and the edge {c,, c 1 } with weight equal 
to M . Moreover, if Ci contains the variable Uj or its complement Uj, we add the 
edge {c l ,Vj} with weight equal to c— he. Finally, if the variable Uj (respectively 
Uj ) occurs in clause Ci, we add the edge {ci,Uj} (respectively {ci,Uj}) with 
weight equal to c. The additional vertices and edges for an arbitrary clause Ci 
are shown in Figure 2, where only one of the dashed edges is actually included. 

It is easy to verify that the resulting graph G is biconnected, with n = 
6v + 2c + 1, m = 8v + 5c so that v(G) = 2v + 3c. 
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Fig. 2. Additional vertices and edges for each clause Ci 




w 



Fig. 3. Tree T' 



To complete the reduction description, we need to assign values to the pa- 
rameters k, e and M . Specific values, which will yield an L-reduction (as we shall 
see in Subsection 2.3), are: 



with a being a sufficiently small fixed constant as, for instance, a < 2/7. 

2.2 Well-Behaved and Quasi- Well-Behaved Trees 

We now investigate the structure of some spanning trees of the graphs G con- 
structed from the instances of MAX-2-S AT-3-B via the reduction. The purpose 
of this investigation is to prove Theorem 1 which plays a key role in the next 
subsection, where we show that the reduction is an L-reduction. 

Definition 1. A spanning tree T of G is well behaved if it satisfies the following 
properties: 

— it contains all the edges of tree T' in Figure 3; 

— for each variable Uj, j = it includes exactly one of the two edges 

{vj,uj} or {vj,Uj}; 

— for each clause Ci, i = 1 , ..., c, it includes {c l , Vj } , for some j = 1 , ..., v, and 
either {ci,Uj} or {ci,Uj}, depending on whether Ci contains the literal Uj or 




(1) 

(2) 



M = 2 c{v + c) + 2ve(k + 1) + 2c(2c + 2e) + ca + 1 
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First we notice that by definition a well-behaved tree T does not include 
edges of weight M. Moreover, for each clause C* the corresponding edge {ci,c 1 } 
belongs to a cycle in G of weight M + 2c or M + 2c + 2e, so that the number c of 
clauses can be partitioned into two sets of cardinality c y and c n , depending on 
whether {d, c 1 } belongs to a cycle of the first or second type. Clearly c = c y +c n 
and it is not difficult but somewhat lengthy to verify that the total weight of the 
fundamental basis associated to a well-behaved tree T, denoted by fundc(T), 
satisfies 

fundc{T) = F + 2ec n (3) 

with F = (M + 2c) (u + c) + 2 ve(k + 1) + 2c(2c + 2e). 

Then we observe that from any well-behaved tree T of G it is possible to 
derive a truth assignment for I that satisfies either the c y or the c n clauses. 
Indeed, we can satisfy the first (respectively second) type of clauses by setting 
to 1 (respectively 0) all variables Uj for which the tree T includes the edge {vj, Uj} 
and all negated variables Uj for which the tree T includes the edge {vj,Uj}. 

Finally, note that if an instance I of MAX-2-SAT-3-B is satisfiable then 
it is easy to construct a well-behaved tree T having fundc(T ) exactly equal to 
F. We will prove that this tree has a set of fundamental cycles that forms a 
minimum fundamental cycle basis. If I is not satisfiable, then we will show that 
any minimum fundamental cycle basis for the cycle space of G has a total weight 
differing from A by a value which allows us to measure the minimum number 
of unsatisfiable clauses in I. 

We can now state the key result. 

Theorem 1 . Every spanning tree T of G satisfying the inequality 

fundc(T ) < F + ca , (4) 

with F, c and a as above, either is well-behaved or it can be transformed into a 
well-behaved tree T with fundciT ) < fundciT). 

A spanning tree T of G satisfying (4) can thus be viewed as a quasi-well- 
behaved spanning tree. From now on let B = {£>i , ...,b m - n + 1 } be the set of the 
fundamental cycles of such a quasi-well-belraved tree T of G. 

The proof of Theorem 1 relies on the following series of properties. 
Proposition 1 . The following holds: 

i) Each edge of G with weight equal to M belongs to only one fundamental cycle 
in B. 

ii) Each fundamental cycle of T having at least one edge of weight M has one 
of these edges as a chord of T. 

Hi) There are in B at most v fundamental cycles having weight less than 1; the 
weight of any other cycle in B is at least 2c + 2e. 
iv) If a cycle b in B includes l edges of weight M then W(b ) > l(M + 2 c — he). 




156 



Giulia Galbiati and Edoardo Amaldi 



Proof, (z) holds because the value assigned to M in (2) makes M{c + v + 1) > 
F + ca , and {ii) follows from (z). Point {Hi) simply follows by looking at G. To 
prove (iv) it is enough to observe that in order to include in a cycle an edge 
{c®, c,;}, two edges respectively of weight c — ke and c must be used; to include 
instead an edge {xj,Zj} two edges of weight c — e must be used and k > 2. 

Lemma 1. The following holds: 

i) the number of cycles in B that include an edge of weight M is exactly c + v; 
ii) the number of cycles in B of weight less than 1 is exactly v. 

Proof. We prove (i). Because of Proposition 1 (z) this number cannot be larger 
than c + v. Suppose it is less than c + v, and equal to c + v — A for some integer 
A>1. We derive a lower bound on W{B) larger than F + ca, thus contradicting 
(4). In fact the sum of the weights of the cycles containing an edge of weight M 
is at least (c + v) ( M + 2c — ke) because of Proposition 1 {iv); moreover there are 
at most v cycles of weight less than 1 and the remaining n c cycles have weight at 
least 2c+2e because of Proposition 1 {Hi). Hence we have: W{B) > {c+v){M + 
2c — ke) + 2 ve{k+ 1) + (2c + 2e)n c with n c = v{G) — u — v — c + A = 2c + A The 
lower bound for W{B) is equal to F — ke{c+v) + A{2c+2e) which is not less than 
F — ke{c+v) + {2c+2e). It is easy to verify that F — ke{c+v) + {2c+2e) > F+ca 
if (1) holds, being v < 2c. The proof of (ii) follows similarly. 

Lemma 2. Every cycle b in B contains at most two edges having weight in [1, c]. 

Proof. Suppose that b has more than two of these edges. We distinguish the 
cases where b contains no edges of weight M or it has exactly one of these edges. 
In the first case we have W{b) > 3c— 1 and therefore, similarly as in the previous 
lemma, we can write: 

W{B) > {3c-l) + {c+v){M+2c) + 2ve{k + l) + {2c+2e){2c-l) = F+c- l-2e. 
In the second case we have W{b) > 3c — 1 + M and we can write: 

W{B) > 3c—l + M+{c+v— 1)(M + 2c) + 2ve{k + l) + {2c + 2e)2c = F+c—1. 
In both cases it is easy to see that, since (1) makes true the inequality F + c — 
1 — 2 e > F + ca, we get the contradiction W{B) > F + ca. Indeed, if e < f then 
1 — < 1 — 1+2e ; since c > 3, any a such that a < 1 — . like the chosen 

a < satisfies the above inequality. 

Lemma 3. Every cycle b in B that contains an edge {c* , c 1 }, for some i = 1, ..., c, 
and an edge {c l ,Vj}, for some j = 1, either it contains {ci,Uj} or {ci,Uj}, 
depending on whether Uj or iij occurs in clause Ci, or it can be replaced in B by 
another cycle having this property without increasing W {B) . 

Proof. Suppose that Uj (respectively iij ) is in clause G t but edge { Ci,Uj } (re- 
spectively \ci, M;})is not in cycle b; call this edge a. Because of Lemma 2 the 
two cases we need to consider are illustrated in Figure 4 where all edges in cycle 
b are drawn in heavy lines but the one belonging to the chords of T which is 
drawn with a light line, as the other edges of G that are known not to belong 
to T. 
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c* c* 





Fig. 4. The cases considered in the proof of Lemma 6 



Case 1. Using Lemma 2 and the fact that every clause in instance / has two 
literals we may conclude that d belongs exactly to two cycles in B , having as 
chords respectively / and a. It is easy to verify that the insertion into T of edge 
a and the removal of edge d decreases W(B) by 2e. 

Case 2. Two subcases need to be distinguished, which are are illustrated in 
Figure 5. 





Fig. 5. The case 2 subcases 



Subcase 2. a. One of the edges g and h is in T. It is easy to verify that d belongs 
exactly to two cycles in B and that the insertion into T of edge a and the removal 
of edge d does not increase W ( B ) . 

Subcase 2.b. None of the edges g and h is in T. Consider the path in T from 
node u to the endvertex Uj or Uj of edge a and let e be the last edge in the path. 
Since chord a would identify a cycle in B contradicting Lemma 2 we conclude 
that this case cannot happen. 
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Corollary 1. Every cycle in B that contains an edge { Cj,c *}, for some i = 
1, c, either has weight equal to M + 2c or equal to M + 2c + 2e. 

Lemma 4. Every cycle b in B that contains an edge {xj,Zj}, for some j = 
either it contains the four edges {xj,Uj}, {uj,yj}, {yj, Uj}, {uj,Zj} or 
it can be replaced in B by another cycle having this property without increasing 
W(B). 

Proof. Because of Lemma 2 the cases to be considered are illustrated in Figure 6, 
where cycle b contains the edges {vj,Uj},{vj,Uj} and either (Case 1) T includes 
one of the edges {uj, y-j}, {yj,Uj} (say [iij, y.j}) or (Case 2) none of them. 





Fig. 6. The cases considered in the proof of Lemma 8 



Let us first examine Case 2. Here tree T must include {iv,yj}. Consider the 
path in T from iv to Uj. It must go through c l or Cj, for some i: hence tree T 
must include two edges incident to c* or c*. Since {cj,c*} is not in T, another 
edge of the four edges incident to c l and c, and of weight less than M must be 
in T, leaving the forth edge able to identify a cycle contraddicting Lemma 2. 
Hence this case cannot happen. 

Now we examine Case 1. The cycles containing edge a, because of Lemma 3, 
only include: 

— the cycle with chord {xj,Zj}, of weight M + 2c + 2 ke — 2e; 

— the cycle with chord d, of weight 2e + 2 fee; 

— cycles with chords {c*,Cj}, for some i, of weight M + 2c; 

— cycles with chords {c l , c^}, for some i, of weight M + 2c + 2e\ 

— cycles that use {yj, uj} and go through some c 1 , of weight 2c+2e (see Figure 
7.(a)); 

— cycles that use {yj, tv} and go through some c,, of weight 2c+2e + 2 ke (see 
Figure 7.(b)); 

If in T we add edge d and remove edge a then the weights of the cycles in the 
list above become, respectively, M+2c, 2e+2 ke, M+2c+2e, M +2c+2e, 2c+2e, 
2c + 2e. Hence only the weights of the cycles in the third group increase but the 
number of these cycles is bounded above by 3. If we require that the maximum 
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Fig. 7. The case 1 subcases 



increase is not larger than a minimum decrease, i.e., that 2e 3 < 2 ke — 2 e and 
hence k > 3+1 like in (1), the conclusion follows. 



Proof of Theorem 1. We first transform T to satisfy the properties required 
by Lemma 3 and Lemma 4. Then we observe that all edges in tree T of Figure 
3, but the {w, yj} for all j = 1, ..., v, must be in T, because of Proposition 1 and 
Lemma 4. Using Corollary 1, Lemma 3 and a reasoning similar to the one used 
to prove Case 2 in Lemma 4, we conclude that edges {w, y 3 } , for all j, are in T. 
Hence tree T of Figure 3 is in T. Finally, Lemma 3 and Corollary 1 imply that 
tree T also satisfies the second and the third properties of a well-behaved tree. 



2.3 MAXSNP-Hardness 

We now prove that the polynomial-time reduction given in Subsection 2.1 yields 
an L-reduction and hence that MIN-FCB is MAXSNP-hard. We start with the 
following result. 

Lemma 5. An instance I of MAX-2-SAT-3-B has at most rj unsatisfiable 
clauses iff there exists in the associated graph G a spanning tree T with 
funda(T ) < F + 2 erj, where F = ( M + 2c) (v + c) + 2 ve[k + 1) + 2c(2c+ 2 e). 

Proof. If an instance I has rj unsatisfiable clauses it is easy to find the required 
spanning tree. Conversely, suppose that a spanning tree T has fundfj(T) < 
F + 2 eij. If y > c then the conclusion is obvious since every instance has at most 
c unsatisfied clauses. If instead rj < c, since e < a/2, we have fundc(T) < F +ca 
and Theorem 1 applies. For the well-behaved tree T that exists according to 
the theorem, we know from (3) that fundcj (T ) = F + 2 ec n < F + 2ey so that 
the number of unsatisfied clauses in / is at most r). 




160 



Giulia Galbiati and Edoardo Amaldi 



Thus there exists a simple relation between the maximum number of satis- 
fiable clauses in instance /, denoted by opt(I), and the minimum total weight 
of a fundamental cycle basis of instance /', denoted by opt(I'). In particular, if 
opt(I) = c* then opt(I') = F + 2e(c — c*). 

Theorem 2. MIN-FCB is MAXSNP -hard and therefore it does not admit a 
polynomial-time approximation scheme, unless P=NP. 

Proof. It suffices to exhibit an L-reduction (ti, f 2 , Pi, Pf) from MAX-2-SAT-3- 
B to MIN-FCB, where, as defined in [3], t\ and i 2 are two appropriate polyno- 
mially computable functions and (3 1 and /? 2 are two positive constants. As t\ we 
use the reduction described in Subsection 2.2, which associates to any instance 
/ of MAX-2-SAT-3-B a particular instance /' of MIN-FCB. The function t 2 , 
which associates to any instance I of MAX-2-SAT-3-B and any spanning tree 
T of the corresponding instance /' of MIN-FCB a feasible solution f 2 (/, T) of 
I, is defined depending on the value of f unda(T). If fundc (T) < F + ca then 
t 2 (J, T) is the truth assignment, which is derivable from the well-behaved tree 
T' that exists according to Theorem 1, satisfying the c v clauses of I as explained 
in Subsection 2.2. Otherwise t 2 (/,T) is any truth assignment for I that satisfies 
more than c/2 clauses. 

The two following inequalities need to be verified: 

1. opt(I') < P 1 opt(I) 

2. \opt(I) - val(I,t 2 (I,T))\ < P 2 | fund G (T) - opt(I')\, 

where for any given truth assignment r for instance I , val(I,r ) denotes the 
number of satisfied clauses. 

The first inequality follows directly from the fact that if opt {I) = c* then 
opt(I') = F + 2 e(c — c*). Indeed k, e , a are positive fixed constants and v < 2c 
because each clause contains two variables and c < 2 opt(I) since at least half of 
the clauses of any instance can always be satisfied. 

To verify the second inequality we proceed as follows. If funda(T) < F +ca 
and T' is defined as above we have that fund(T') = F + 2 ec„ and c y = c — c n . 
Hence we have that \fundc(T) — opt(I')\ > \fundG(T') — opt(I')\ = 2e\c y — c*| = 
2e\opt{I) — val(I ,t 2 (I ,T))\ and 2. holds with /3 2 = A. 

If fundc;{T) > F + ca the definition of function f 2 implies in this case 
| optfl) — val(I, t 2 (/, T))| < §. We derive easily that | = ^(src) < ^(2ec*) < 
7^(2 ec* + ca — 2 ec) = jz (ca — 2.ec^). From the lower bound on fundc(T) it 
follows that | fundo{T) — opt(I') \ > ca — 2 ec* which concludes the proof. 

3 Approximability Results 

We now turn to some upper bounds on the approximability of MIN-FCB that 
derive from recent results regarding related problems. We first consider the gen- 
eral case of arbitrary weighted graphs. 

Proposition 2. MIN-FCB is approximahle within 2°^ lognlos lo s n ). 
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Proof. When the problem is restricted to instances with all weights equal to 1, 
the above approximability bound is a straighforwarcl consequence of the main 
result in [2], In the general case of arbitrary weighted graphs, we proceed by 
reduction from MIN-FCB to MCT and we use the polynomial-time approxi- 
mation algorithm for MCT given in [19]. Let G = (V,E) be the graph of an 
arbitrary instance of MIN-FCB. For each pair (i.j) of vertices of G, a require- 
ment r(i,j) is specified as follows: r(i,j) = 1 if i and j are the end vertices 
of an edge of E and r(i,j) = 0 otherwise. Let G be the graph with the added 
requirements, which is a particular instance of MCT on graphs that are not 
necessarily complete. For any subset S C E, let W(S) = X^ee-E w ( e )- 

For any spanning tree T (of G and also of G ) the functions fund G {T) and 
com G ' (' T ) are well defined and it is easy to verify that they satisfy: 

com. G ' (T) = fund G (T) - W{E) + 2 W(T). (5) 

Now, let T* (respectively by Ttf ) be a spanning tree that minimizes the 
function com G > (T) (respectively fund G (T)). Since com G >(T*) < com G '(TJ) 
and for any subset S C E we have W(S) < fund G (TJ) relation (5) directly 
implies the following. 

Claim 1 fund G {Tf) < 3 fund G {TJ). 

Claim 2 If MCT with requirements in {0, 1} is approximable within p > 1, 
then MIN-FCB is approximable within 3 p+ 1. 

Let T' c be a spanning tree of G that solves MCT within p, i.e., com G > (Tf) < p 
com G ' (T*). We derive that: 

fund G (T' c ) = com G > (T') + W(E) - 2 W(T') < p com G > (T*) + W(E) - 2 W{T' C ) 

< p com G ’ ( T* f ) + W(E) - 2 W{T' C ) 

< p ( fund G (T •}) - W(E) + 2 W(Tf)) + W(E) - 2 W{T' C ) 

< P fund G (T] ) + 2 pW{T}) + W(E) < (3p + l)fund G {T}) 

and this shows that T' c is a 3p+ 1-approximate solution of MIN-FCB. 

Since in [19] it is proved that MCT with arbitrary graphs and arbitrary re- 
quirements is approximable within 2 c ’(' /log " log lo s ra ) ? the conclusion follows from 
Claim 2. 

Better upper bounds on the approximability of MIN-FCB can be derived 
for dense graphs. 

Proposition 3. MIN-FCB is approximable within 0(1) if ttil = 0(1) and 
within O(lnn) if = O(lgn), where niL denotes the number of edges in the 
complement of the graph G. 

Proof. The proof is similar to that of Proposition 2. Instead of the polynomial- 
time approximation algorithm for MCT with arbitrary graphs and requirements, 
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we use here the polynomial-time approximation scheme for MCT with arbitrary 
graphs but all requirements equal to 1 (MRT), presented in [23]. Therefore, given 
the graph G of an arbitrary instance of MIN-FCB, we set all requirements r(i,j) 
in G equal to 1. The analogue of (5) now becomes: 

com G > (T) = fund G (T) -W(E) + 2 W(T) + ^ w T (i,j), (6) 

{iJHE 



where Wr(i,j ) denotes the weight of the unique path in tree T joining vertices 
i and j. According to the analogue of Claim 1 we have fundc (T* ) < (3 + 
niL) furidc(Tj '). According to the analogue of Claim 2, if MCT with uniform 
requirements is approximable within p then MIN-FCB is approximable within 
p(3 + tol) + 1. Since in [23] it is proved that MRT is approximable within (1 + e), 
for every £ > 0, then MIN-FCB is approximable within (1 + s)(3 + tol) + 1, 
and the conclusions follow both for niL = 0(1) and niL = O(lgn). 

Proposition 3 implies therefore that MIN-FCB, restricted to complete graphs, 
is approximable within a constant factor of 4 + £, for any e > 0, since uil — 0. 
This is in particular true for metric graphs, which are complete and whose edge 
weights satisfy the triangle inequality. 

The question naturally arises as to whether the recent results on the approx- 
imability of finite metrics by tree metrics (see e.g. [4,6,14,9]) imply stronger 
approximability upper bounds for MIN-FCB. In particular in [9] it is shown 
that given any metric graph M = (V. E) with additional “weights” r(i,j) on the 
edges of M, one can find in polynomial-time a single tree T such that for all i,j 
in V, we have dM{i,j) < dr{i,j) and 

r (bj) d T(hj) < O(logn) ^2 r(i,j ) d M (i,j ) (7) 

i,jev i,j£V 

where dM(i,j) denotes the length of a shortest path between nodes i and j in 
graph M and dx{i,j) the length of the unique path between i and j in tree T. 
Although the above problem is closely related to MIN-FCB, this result does 
not imply a better approximation factor for MIN-FCB since the tree found is 
not guaranteed to be a spanning tree of the original graph G. This property is 
clearly true for metric graphs which are complete. But in this case the above- 
mentioned result implies a logarithmic approximation factor which is weaker 
than the constant factor of Proposition 3. 



4 Concluding Remarks 

We have presented the first results regarding the approximability of the problem 
of finding fundamental cycle bases of minimum total weight in undirected bicon- 
nected graphs. On the one hand, we have proved that MIN-FCB is MAXSNP- 
lrard and hence does not admit a polynomial-time approximation scheme, unless 
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P=NP. Thus MIN-FCB turns out to be harder to approximate than the re- 
lated problem MRT [23]. On the other hand, we have derived the first upper 
bounds on its actual approximability factor for arbirary graphs as well as for 
dense graphs. We leave as an open question whether MIN-FCB is approximable 
within a logarithmic factor for arbitrary graphs. 
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Abstract. We consider the problem of escaping from an unknown polyg- 
onal maze under limited resources and under errors in sensors and mo- 
tion. It is well-known that the pledge algorithm always finds a path out 
of an unknown maze without any means of orientation, provided that 
such a path exists. The pledge algorithm sums up the turning angles 
along the boundary of the obstacles and plans its way by using a single 
counter and no further information. The correctness proof makes use of 
the fact that motions in the free space and the measurement of turning 
angles can be done exactly. We consider the case that the free motion of 
the robot and the computation of turning angles are erroneous. 

This work is a first attempt to incorporate error assumptions into theo- 
retically sound proofs of online motion planning algorithms. 



1 Introduction 

Online motion planning in unknown environments is theoretically well-under- 
stood and practically solved in many settings. During the last decade many 
different objectives where discussed under several robot models. 

For instance, the task of searching for a target using a touch sensor and a 
compass to the goal was first considered by Lumelsky and Stepanov [12]. They 
invented the BUG algorithm and many variants of this algorithm were discussed 
and analyzed afterwards, for example see Rajko and La Valle [14] and Kamon 
and Rivlin [10]. A BUG-like algorithm was also used for the Mars-Rover project, 
see Laubach [11]. The correctness of the algorithms are proven under idealistic 
assumptions whereas the practical relevance is always tested empirically. 

Moreover, the task of exploring an unknown environment has attracted the- 
oretical and practical attention. Deng et. al. [4] studied the case of a robot 
equipped with a vision system that maps an unknown, simple, rectangular en- 
vironment. They developed a strategy and prove that under correct vision and 
motion the corresponding path of the robot is never longer than \/2 times the 
optimal path computed under full information in advance. The strategy is called 
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\[2 -competitive in this model. Under the same idealistic assumptions Hoffmann 
et. al. [8] presented and analyzed an 26.5-competitive algorithm for exploring the 
interior of a simple polygon. On the other hand the task of building a map of 
an unknown environment is solved practically and analyzed empirically in the 
field of robotics, see e. g. Batalin and Sukhatme [2] or Yamauchi et. al. [17]. For a 
general overview of theoretical online motion planning problems see the surveys 
[15,16,3,13,9], 

Theoretical correctness results and performance guarantees from labyrinth 
theory or computational geometry suffer from idealistic assumptions, therefore 
in the worst case a correct implementation is impossible. On the other hand 
practioners analyze correctness results and performance guarantees mainly sta- 
tistically. So it might be useful to investigate, how online algorithms with idealis- 
tic assumptions behave, if those assumptions cannot be fulfilled, more precisely, 
can we incorporate assumptions of errors in sensors and motion directly into the 
theoretical analysis? 

As a first approach we consider a simple and theoretically sound online strat- 
egy, the well-known pledge algorithm, see Abelson and diSessa [1] and Hemmer- 
ling [7]. Given an unknown polygonal maze, the robot has to leave the maze, 
using nothing else than a touch sensor, the ability to measure its turning angles 
and a very limited amount of memory. The robot is not able to create a map or 
to set landmarks. The pledge algorithm assumes that the robot is able to move 
a straight line between the obstacles and to count its turning angles correctly. 
Gritzmann [5] remarked that it would be interesting to know how the pledge al- 
gorithm behaves, if those assumptions cannot be fulfilled. Of course, if the robot 
can make arbitrary big mistakes, there are always environments, in which the 
robot is hopelessly trapped. But what if the robot’s errors are small enough? 
Are there upper bounds for measuring errors? We investigate which conditions 
must hold to ensure that a robot can leave an unknown maze with a pledge-like 
algorithm. 

The paper is organized as follows. In Sect. 2 we specify the robot’s task 
and the model of the robot and its environment. We recapitulate the pledge 
algorithm, and analyze the sources for errors in this algorithm. With this insight, 
in Sect. 3 we define a class of curves in the robot’s environment that fulfill a set 
of conditions. Further we show that a robot will escape from an unknown maze, 
if its path somehow follows a curve from that class. Last, in Sect. 5 we discuss 
the impact of our results on the design of a robot. Our main result is, that a 
robot equipped with a compass with reasonable small errors is able to leave an 
unknown maze using the pledge algorithm; see Sect. 4.1. Further we show results 
for a robot with exact free motion and bounded angle measuring error and for 
a robot in an ‘almost rectangular’ environment. 

2 Preliminaries 

Let us assume that a maze with polygonal shaped obstacles in the plane is given. 
The robot is able to recognize and follow a wall in a specified direction (w. 1. o. g. 
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counter-clockwise) and to count the turning angles. How these abilities can be 
realized depends on the hardware of a specific robot. For example, the turning 
angles may be counted by odometry or by measuring angles along the walls with 
sensors. Other abilities for orientation and navigation are not required, especially 
it is not necessary that the robot can build a map of its environment. 

The task of leaving an unknown maze can be solved using the well-known 
pledge algorithm, see algorithm 1 , which performs only two types of movements: 
Either the robot follows the wall of an obstacle and counts the turning angles, or 
the robot moves through the free space between the obstacles in a fixed direction. 
The latter task always starts at vertices of the obstacles when the angle-counter 
reaches a pre-defined value. As soon as the robot leaves the maze it recieves a 
signal of success. 

Algorithm 1: Pledge 
REPEAT 

U! = 0 

REPEAT 

Move in direction to in the free space 

UNTIL Robot hits an obstacle 

REPEAT 

Follow the wall in counter-clockwise direction 

Count the overall turning angle in u 

UNTIL Angle Counter u> = 0 
UNTIL Robot is outside the maze 



Both types of movements in the pledge algorithm may be afflicted with errors. 
Either the turning angles are not measured exactly and the robot leaves the 
obstacle earlier or later than expected, or the robot cannot follow its initial 
direction during the movement in the free space. In the idealised setting the 
robot is error-free and it was shown by Abelson and diSessa [1] and Hemmerling 
[7] that in this case a robot will escape from a polygonal maze by performing 
the pledge algorithm provided that there is such a solution. An example of the 
robot’s path using an error-free pledge algorithm is given in Fig. 1. The angle 
counting technique is illustrated for the second obstacle: After the robot hits the 
obstacle in p 2 it turns about — f to follow the wall. In p 3 the robot turns about 
+ j to follow the next wall. Finally, in P 4 it turns about again until the angle 
counter reaches zero and the robot leaves the obstacle. Observe that the robot 
does not leave the obstacle in p 1} since its angle counter is —2tt instead of zero. 

In the following we define a class K, of curves in the robot’s workspace. The 
curves in /C represent possible paths that lead to an exit, even if the robot’s 
sensors and motions are erroneous. For convenience, we assume that the robot 
is point-shaped, but this is no restriction as long as the robot is smaller than 
the distances between obstacles. So the parts of a curve that map to a move- 
ment along a wall, are line segments on the boundaries of obstacles. To escape 
from an unknown maze, the robot’s strategy is not required to calculate a path 
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Fig. 1 . The path of the pledge algorithm. 



that matches a curve in K, exactly, rather it is sufficient that the robot’s path 
orientates essentially at a curve in /C. For example the robot may follow a wall 
in a certain distance, because it is not point-shaped, or it may follow a wall in a 
zig-zag manner. See Sect. 4 for more details. 

Since every point on the curve represents a position as well as a heading, 
the curve is a subspace of the workspace C = 1R x IR x 1R and a point will be 
described as C(t) = (P(t),ip(t)), where P(t) = (. X(t),Y(t )) denotes the position 
at time t and ip(t) the heading. Note, that ip(t) is the sum of all turns the curve 
has made so far, so if the curve has made two full turns counterclockwise then 
<p(t) equals 47r instead of zero. 

In order to classify the possible positions in the workspace, we divide the 
space of positions, P = IR x IR, into three subspaces: First, the space of forbidden 
configurations, Cf or b, the union of the interior of all obstacles. Second, the space of 
half-free configurations, Chaif, that is the union of the boundaries of the obstacles, 
and last the free configurations, Cf ree , where P(t) ^ Pi for all obstacles Pi holds. 

If a curve hits a point p = P{H Z ) in Chaif after a movement through Cf ree , 
we will call H, a hit point. If the curve leaves Chaif and enters the free space at 
q = P{Li ), we call Li a leave point. With respect to the pledge algorithm we 
assume that every leave point belongs to a vertex of an obstacle. 

3 Sufficient Conditions 

Let us assume that it is possible to leave a given unknown maze. Within this 
section we establish a set of sufficient conditions for escaping from the maze. The 
robot follows- somehow — a curve C and to ensure that the robot can escape we 
want to avoid infinite cycles in the curve. 

There are mainly two sources of errors in the pledge algorithm: one concerns 
the motion in the free space and the other the motion along the boundaries of 
the obstacles. The first type of error occurs, if the robot is not able to move a 
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straight line in the free space, the latter, if the robot’s angle counter is in some 
way inaccurate. Both types lead to a set of sufficient conditions that ensures the 
correctness of an error-afflicted pledge algorithm. 




To establish the set of conditions, we first observe that already small counting 
errors along the boundary of obstacles or deviations in the free space can sum 
up to a big mistake and lead to an infinite cycle. Figure 2 shows an example: 
The dashed lines show the correct leaving direction with respect to the direction 
in the hit point. Between the obstacles the robot drifts a little bit away from the 
correct direction and ends up in an infinite loop. This would happen even if the 
curve would not be allowed to make a full 27r turn in the free space. Obviously, 
cycles would be inhibited, if the curve between two obstacles would stay in a 
wedge around the initial direction. In fact, this wedge can be as large as a half 
space! This leads to the condition that for any two points in the free space, the 
difference in the headings should be lower than tt. We will refer to this as the 
free space condition. 

Unfortunately the free space condition is not sufficient. Figure 3 shows two 
examples, where C has a cycle, although the free space condition is fulfilled. 
In both cases the curve starts in s, meets an obstacle in Hi, misses the first 
possible leave point p and leaves the obstacle at another vertex q. The curve in 
Fig. 3(i) hits the same obstacle again in H k . In Fig. 3(ii) the curve hits another 
obstacle, leaves this one in L k and hits the first obstacle again in Hk . In both 
cases P(Hk) is visited two times, at Hk and tk . In the first case, the heading 
in tk is slightly larger than tt, in the second case +f. Observe that the curve 
in Fig. 3(ii) represents a second mistake: the heading in the hit point H k is not 
zero, as it should be according to the pledge algorithm. This may occur if the 
last obstacle was left too early or the path between the obstacles is not a straight 
line segment or this may be a combination of both reasons. However, the heading 
in Hk is — | £ and both mistakes sum up to an error that is slightly larger 

than tt, too. 

Note, that the problem is not related to a second visit of a single point. The 
curve of the error-free pledge algorithm, may have many self-hits. It is rather 
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Fig. 4. The difference between a crossing (i) and a touch (ii) at £ 2 - 



the fact that the heading in £ k is quasi overwinded in terms of the heading in 
the hit point. 

Those observations lead to the conjecture that whenever the curve hits an 
obstacle at and there exists a point tk with P(Hk) = P(£fc), then ip(tk) — 
tp(Hk) < 7T should hold. We will refer to this as the obstacle condition. In the 
rest of this chapter, we will show that both conditions are sufficient. 

Definition 1 . Let K, be the class of curves in Cf ree LiCh a if that satisfy the following 
conditions: 

(i) The curve circles an obstacle in a counter-clockwise direction. 

(ii) Every leave point belongs to a vertex of an obstacle. 

(Hi) Vf i , t‘2 6 C : P(t \) , Pftf) € C/ re e => |v 5 (^i) — ¥>(£2)1 < ^ (Free space condi- 
tion). 

(iv) VHi,t € C : P(t) = P{Hi) => tp(t) — ip(Hi) < n (Obstacle condition). 

Obviously, the curve of the error-free pledge algorithm is a curve in 1 C. The 
curves in /C have two important properties that will be shown in the following 
two lemmata. 

Lemma 2. A curve C £ K. cannot cross itself. 

Note, that a curve of /C can touch itself, see Fig. 4. 

Proof. Let us assume, C crosses itself. Consider the first crossing of C, i. e., there 
are two parameters £1 and £2 with t\ < £2 and P(£i) = P(t 2), where a crossing 
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Fig. 5. A counterclockwise turn and a crossing respectively no crossing. 




Fig. 6. A clockwise turn and a crossing respectively no crossing. 

occurs in P(t 2 ) and no crossings exist before t 2 . The curve C will make either 
a counterclockwise or a clockwise loop between t\ and t 2 . If this first crossing 
happens in the free space, the curve will violate the free space condition, so the 
first crossing could occur only in Chaif- 

Let us consider the case of a counterclockwise loop, see Fig. 5(i). The curve 
hits an obstacle at Hi, makes a counterclockwise turn, meets P{Hi) for tj again 
and has a crossing for £2 > U > Hi. Note, that there is no crossing, if the point 
P{Hi) is not met between t\ and t 2 , see Fig. 5(ii). 

W.l.o. g. we assume ip(Hi) = 0. The loop may leave the obstacle or not, 
however, we reach t. t with the heading <p(tj) + (—7) = 2-7T. At P(Hi) the robot 
turns clockwise with angle 7 to follow the obstacle boundary, so —n < 7 < 0 
must hold. Hence <p(U) is greater than 7 r and the obstacle condition is violated. 

Now we look at a clockwise turn. The curve hits an obstacle at Hi, follows 
the obstacle and leaves the obstacle. Eventually, it returns to the obstacle at 
another hit point H k > Hi and has a crossing at t 2 , see Fig. 6(i). The point 
P(Hk) has to be met before at t k between Hi and t\. Otherwise the curve only 
touches itself and there is no crossing at t 2 , see Fig. 6(ii). 

Let (p(H^) denote the heading immediately after the robot has turned in 
H k , thus <p(H^) = ip(Hk) + 7. As above we have — 7r < 7 < 0. On the other 
hand, the curve has made a full clockwise turn between t k and H£, therefore 
<p(H£) = tp(tk ) — 27 r. Finally, the obstacle condition has to be fulfilled, too: 

¥>(*fc) - <p(H k ) < 7T 

tp(H£) + 27 r - ip(Hk) = tp(Hk) + 7 + 27 t - ip(H k ) < 7 r 

<t=> 7 < — 7T 
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It follows that the first crossing cannot exist, and— by induction — the curve 
cannot cross itself. □ 

Lemma 3. A curve C £ K, will hit every edge in the environment at most once. 







Fig. 7. A curve that hits an edge twice. 



Proof. Let us assume, the curve C will hit an edge e more than once: after a first 
hit at Hi the robot moves around and hits e again at H k , see Fig. 7. At P(Hi) 
respectively P(Hk) the robot turns clockwise to follow the edge e, therefore 
— 7 r < 7 i, 7 fc < 0. Let tp{Hf ) and be defined as in the proof of Lemma 2. 

W. l.o. g. we assume = 0. Since the curve in Hf and follows the 

same edge e, the headings and <p{H^) must be mod 27 t the same, i.e., 

<p(H^) = 2kn, k £ 7Z. For k ^ 0, with tp(Hi) = - 7 * and ip(H k ) = 7 *, it 

follows that | <p(Hk) — \ = |2fc7r — "f k + 7 »| > n and the free space condition 

would be violated. 

Therefore, we can assume k = 0 and = 0. Consider the part of C 

between the first and the second visit of P(H k ) (in a situation as shown in 
Fig. 7(i)) respectively the curve between the consecutive visits of P(Hi) (in a 
situation as shown in Fig. 7(ii)). If this loop, i, has no crossings, then £ is a jordan 
curve and C makes a ±27 t turn in l. Thus <p(H^ ) equals ±27 t, in contradiction 
to our assumption. Hence the curve between the two visits of e must have at 
least one crossing. But this contradicts to Lemma 2. □ 

Finally, with Lemma 2 and Lemma 3 we are able to show that the conditions 
from Definition 1 are sufficient to solve our problem. 

Theorem 4. A robot, whose path follows a curve C £ 1C, will escape from an 
unknown maze, if this is possible at all. 

Proof. A curve that meets the conditions from Definition 1 will hit every edge 
in the environment at most once. Therefore the robot will either escape at the 
latest after it has visited all edges, or it will be trapped, because there is no 
escape from the maze. □ 

4 Applications 

Within this section we discuss the practical relevance of Theorem 4. What con- 
sequences does it have for the design of a robot that should be able to leave an 
unknown maze? 
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Fig. 8. A curve and a related robot’s path. 



Still we have to discuss, what it means that the robot follows a curve C G 1C. 
In fact, the deviation from the robot’s path, 77, and C can be as large as shown 
in Fig. 8. The path 77 might be produced by an arbitrary strategy, C shows 
the corresponding curve. The dots on 77 represent the hit points in terms of 
the robot’s strategy, i.e., the strategy switches from a move-through-free-space 
behavior to a go-around-obstacle behavior. Amazingly it is sufficient that the 
hit points met by C and 77 refer to the same edges — including an imaginary 
‘final edge’ which marks the successful escape from the maze. The sequence of 
hit points is a kind of Ariadne’s Thread that leads the robot out of the maze. 
Between the hit points the robot can do whatever it wants, it can even hit some 
other edges. 

More precisely let He be the sequence of edges hit by the curve C, and 
Hn the sequence of edges that are related to a hit point of the robot, then the 
following holds: 

Corollary 5. A robot escapes from an unkown maze, if there exists a curve 
C G 1C, such that He is a subsequence ofHn- 



4.1 A Simple Compass Is Sufficient 

Let us now assume that the robot is equipped with an error afflicted compass. 
Additionally, the robot should be able to follow walls and walk in the free space 
until it detects hit points. Let the robot measure its turning angles by using 
compass values. How this is implemented will depend on the specific robot. 
Note, that just (even exact) compass readings without measuring turning angles 
is not sufficient, because only directions mod 27 r can be detected in this case. 

If the compass has a measuring error less than | it should be easy to ensure 
that the heading of the robot in the free space remains in an intervall of ]— | , | [. 
This guarantees the free space condition at hand. Additionally, at every detected 
hit point Hi we have ^{Hf) G ] — §, f[. 
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On the other side it can be assumed that it is easy to detect 2i r or — 2n turns 
along the walls, although again error afflicted within a range of ]— | | [, due 
to the compass inaccuracy. Therefore we can assume that the total measured 
turning angle along the walls is always within a bound of ]— away from the 
exact turning angle. So while the robot follows a wall being at position t £ C 
we have ip(t) < |. Altogether, the obstacle condition is fulfilled. 

Corollary 6. Let an unknown maze be given and let the robot be equipped with 
an error afflicted compass. Let us assume that it is possible to escape from the 
maze. If the accuracy of the compass is not worse than a simple pledge-like 
strategy leads to an exit of the unknown maze. 

4.2 Pseudo Rectilinear Scenes 

Now we assume that the environment has more or less axis-parallel walls. To be 
more precise we introduce the notion of (a, (3) pseudo-rectilinear polygons and 
specify the robot’s ability with respect to a and (3. 

A convex (concave,) corner of a simple polygon P is a vertex of P with outer 
angle greater (smaller) than n. A simple polygon P is called pseudo-rectilinear 
iff \{p | p is a convex corner of P}| = |{p | p is a concave corner of P}| + 4 . 



Fig. 9. A pseudo-rectilinear polygon with anglediv(u) = a and edgediv(e) = /3. 

In Fig. 9 there is an example of a pseudo-rectilinear polygon. There are several 
ways to measure the quality of rectilinearity of a pseudo-rectilinear polygon. 
Our aim is that the robot should be able to count the turns along obstacles by 
considering the number of passed convex and concave vertices. Therefore it is 
important to detect whether a vertex is convex or not which gives rise to the 
following definitions. 

For a vertex v let Z(u) denote the outer angle at v. We define the angle- 
divergence at a single corner as follows (see Fig. 9): 




v 
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On the other hand the robot should be able to leave an obstacle along a wall 
in ‘horizontal’ or ‘vertical’ direction. For this purpose we have to consider the 
divergence of the edges from the X-axis respectively Y-axis. 

For an edge e = vw let edgediv(e) denote the smallest angle between e and 
an axis-parallel line through v or w (see Fig. 9). For a pseudo-rectilinear polygon 
P we define the edge- divergence edgediv(P) of P by 



A pseudo-rectilinear polygon P is called (a, /3) pseudo-rectilinear iff 
anglediv(P) < a and edgediv(P) < f3 holds. 

We assume that the robot measures angles at corners respectively between 
its heading and a wall within an accuracy of <5, that is, the absolute difference of 
the actual angle and the measured angle is smaller than S. In order to guarantee 
that convex and concave corners are detected correctly it obviously suffices to 
require that a + 5 < f , see Fig. 10. 



Fig. 10. Detecting concave and convex corners under the presence of errors a and S. 
Obviously, a + 5 should not exceed J . 

W. 1. o. g. we assume that the robot’s predefined direction is north, i. e., | in 
the mathematical sense. The correctness of the movement in the free space will 
depend on (3 and <5, we have to require that the next hit point is appropriate. 
First, we require that the robot’s path between a leave point and a hit point 
should be monotone with respect to the leaving direction. On the other hand it 
is important that the robot hits a horizontal edge at the end of its free space 
movement, thus it will be able to go on counting correctly. Note, that between 
two hit points the robot can hit and detect vertical walls. In this case we let the 
robot simply slide along them. 

We want to make sure that the robot is able to detect whether an edge is 
‘horizontal’ or ‘vertical’. Naturally, if the measured angle between the robot’s 
heading and an edge is smaller than t and greater than f, it is reasonable that 
the robot assumes that the edge is horizontal. 



edgediv(P) := maxedgediv(e) . 
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In this sense it suffices to guarantee that the robots heading in the free 
space lies in the range ] \ + (3 + 5, |7r — (3 — $[. For example in Fig. 11 the robot 
leaves the obstacle at point p and meets edge e with heading <j>. Since e should 
be detected as ‘vertical’ the angle 7 — S should be greater than | 7 r. We have 
7 = 7T— (jt — (| — (3) — (ft) and therefore we require (7— <5) = ^ — (/3+5)+(j) > |7r 
which in turn is equivalent to </> > j + /3 + 6 . With similar arguments we get the 
upper bound of the range ]j+/3 + 5, — (3 — <5 [. 




Fig. 11. Detecting ‘horizontal’ and ‘vertical’ edges under the presence of errors (3 and 8. 



We assume that the robot counts convex and concave corners rather than 
counting angles at obstacles. With respect to the leaving direction at a leave 
point the robot always can start with a heading in the range [f — (3, f + /3] ■ 
Altogether, according to the range above, it suffices to require that the robot 
should be able to guarantee this heading up to an angle ^ — <5 — 2/3 > 0. Obviously, 
2/3 > a holds. Therefore the condition a+S < § is already fulfilled, if ^—<5—2/3 > 
0 holds. 

Corollary 7. Let an unknown maze with a set of (a, /3) pseudo-rectilinear poly- 
gons be given. Let the robot be able to measure angles within an accuracy of S. 
Let us assume that it is possible to escape from the maze. 

If the robot is able to guarantee its heading in the free space up to an angle 
f — S — 2/3, a simple pledge-like strategy lead to an exit of the unknown maze. 



4.3 Exact Free Motion 

Now let us assume that the robot is able to move a straight path between obsta- 
cles, just its angle counter is inaccurate in some way. Let /3,; denote the difference 
between the real angle and the measured angle at the itli turn and n the number 
of vertices in the environment. 



Lemma 8. If the robot is able to move a straight path in the free space and 

e 



that 



E A 

i=k 



< 7 r holds for all k < I < m, where m denotes the number 
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of turns the robot has made so far , then it will be able to escape from an unknown 
maze. 

Proof. The new condition implies that the absolute value of the measuring error 
that is the difference between the robot’s heading and its angle counter, never 
exceeds n. Now we have to show that our new condition does not violate the 
conditions from Definition 1. 

First, let us assume that the free space condition is not met, so there must 
be two points ti,t 2 in the free space where \<p{t\) — c^(i 2 ) | > ^ holds. W. 1. o. g. 
we define <p(t\) = 0. Since the robot moves a straight line in the free space, the 
headings at the leave point, the following hit point and all points between them 
must be the same. So there must be a leave point L k with |<^(Lfc)| = |^(t 2 )| > 7r. 
But the robot leaves an obstacle only, if its angle counter has reached zero, so 
the absolute value of the measuring error in L/ ; is at least 7 r. 

Second we assume, the obstacle condition would be violated. Then there must 
be an obstacle Pi with a hit point H k and another point tk with P(tk) = P{Hk), 
such that ip(tk) — <p(Hk) > tt holds. Let w.l.o. g. (p(Hk ) = 0, then ip(tk) > ^ r 
holds, but the angle counter can’t be greater then zero, since the robot leaves 
the obstacle as soon as the counter becomes zero. So the difference between the 
robot’s heading and its angle counter must be at least n. □ 

Now, let /?max := max /?*. With Lemma 3 it is easy to see that the robot visits 
at most n 2 vertices, and we have the following result: 

Corollary 9. A robot that guarantees |/3 max | < \ is able to escape. 

5 Conclusion 

We considered the pledge algorithm under errors in sensors and motion and 
established sufficient requirements for the robot for leaving an unknown maze. 
Especially, we showed, that a robot equipped with a simple compass will fulfill 
this task, we gave upper bounds for errors in pseudo rectilinear mazes and for 
robots with correct motion. Furthermore, in Corollary 5 we gave a framework 
for proving the correctness of arbitrary strategies for leaving mazes. 

Within a simulation environment we have incorporated several sources of 
errors into an implementation of the pledge algorithm, see [6]. Next, we are 
going to implement the pledge algorithm for a Pioneer 2 AT Robot and will 
further observe its behavior on pseudo rectilinear mazes and other error afflicted 
settings. We are quite sure that on smooth terrains our robot will be able to 
fulfill the required conditions. 
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Abstract. We study the online matching problem when the metric 
space is a single straight line. For this case, the offline matching problem 
is trivial but the online problem has been open and the best known com- 
petitive ratio was the trivial 0(n) where n is the number of requests. 
It was conjectured that the generalized Work Function Algorithm has 
constant competitive ratio for this problem. We show that it is in fact 
l?(logn) and O(n), and make some progress towards proving a better 
upper bound by establishing some structural properties of the solutions. 
Our technique for the upper bound doesn’t use a potential function but 
it reallocates the online cost in a way that the comparison with the offline 
cost becomes more direct. 



1 Introduction 

We study the online version of the weighted bipartite matching problem in which 
the nodes of one partition, called the servers, are known in advance and the nodes 
of the other partition, the requests, arrive online and must be matched immedi- 
ately. The objective is to minimize the cost of the matching. The more general 
problem in which the nodes of both partitions arrive online and must be matched 
as soon as possible is not essentially different; by appropriate partitioning the 
sequence of points, it is simply a repeated version of the simple version and all 
our results apply to the more general problem. Of special interest is the metric 
matching problem in which the points, both servers and requests, lie in some 
metric space and especially in a Euclidean space. 

The offline matching problem is one of the most fundamental algorithmic 
problems and it has played a central role in the development of the theory of 
algorithms [2,3,10]. The online version of the problem has also been studied 
before. 

Karp, U. Vazirani, and V. Vazirani [8] were the first to study some online 
version of the matching problem. They studied randomized algorithms for online 
matching in an unweighted bipartite graph when the optimum matching was a 
perfect matching. Khuller, Mitchell, and Vazirani [9] studied the online version 
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of minimum bipartite matching. They showed that for non-metric spaces the 
competitive ratio is unbounded. They also proposed a natural algorithm, Per- 
mutation, which has optimal competitive ratio for arbitrary metric spaces; its 
competitive ratio is 2 n — 1, where n is the number of requests. The same algo- 
rithm was studied independently by Kalyanasundaram and Pruhs [5,7]. They 
conjectured that the Work Function Algorithm has constant competitive ratio 
(see their review article [6]). 

Unlike the minimization version of the online matching the maximization 
version has been completely settled. For non-metric spaces the competitive ratio 
is unbounded [9] and for metric spaces the greedy algorithm is optimal and has 
competitive ratio 3 [5]. 

Here we are concerned with the special case of the metric online minimum 
matching where the servers and requests lie on a line (1-dimensional Euclidean 
space). This is perhaps the most interesting case of online matching for many 
reasons. First, the offline matching problem for this case is trivial and therefore 
competitive analysis appropriately captures the deterioration of performance 
(that is, the approximation ratio or competitive ratio) due to lack of complete 
information. Second, it is a natural generalization of other fundamental online 
problems such as the cow-patlr problem (also known as the bridge problem) 
[1]. Third, versions of online matching problem play a central role in electronic 
markets where “buyers” and “sellers” arrive online, name their prices, and must 
be “matched” . One can view the prices as points on a line that must be matched 
online. The version of the matching problem that we study in this work together 
with the analogous maximization version, are the simplest, pure abstractions of 
these online matching problems. 

It is easy to show that no online algorithm can have competitive ratio less 
than 9 since the online matching problem is a generalization of the cow-patlr 
problem (see the beginning of the Section 3). Very recently Fuchs, Hoclrstattler, 
and Kern [4] gave a lower bound of 9.001 which suggests that the problem 
is not simply a disguised generalization of the cow-path problem. It has been 
conjectured [6] that the matching problem on the line and in particular the Work 
Function Algorithm (WFA) have constant competitive ratio. Here we disprove 
the conjecture and establish a lower bound of i?(log n) where n is the number of 
requests. We also show an upper bound 0(n) for the competitive ratio of WFA. 
The bound by itself is weak but the proof technique is more important. It exhibits 
structural properties of the solutions that can lead to improved bounds. Using 
these properties we show that the competitive ratio is no more that the height of 
the online matching (i.e., the maximum number of lines through a point). We can 
also establish a tight upper bound (0(log?r)) for some special cases (restricted 
adversaries), but due to lack of space we omit these results from this abstract. 

Outline 

In Section 2 we define the WFA and prove some properties that are useful in 
general and in particular they simplify the argument for the lower bound. We 
then give the relatively simple lower bound (Section 3). 
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To prove the upper bound, we first establish some useful properties and 
invariants of the WFA (beginning of Section 4). We now want to compare the 
cost of the WFA to the optimal matching (opt). However, the optimal matching 
can use different set of servers than WFA. We show (Subsection 4.1) that this 
is not a serious problem: the WFA uses an almost optimal set of servers in the 
sense that the optimal offline matching involving the servers of WFA ( popt ) 
is within a constant factor from the optimal (opt). We need therefore only to 
compare the online cost with popt. We don’t know how to do it directly though. 
As it is frequently the case, it is not easy to apply the potential function method 
to online algorithms with non-constant competitive ratio. Indeed, we don’t use a 
potential function but we find a way to reallocate the online cost in such a way 
that it becomes easier to compare it to popt. A major obstacle is that popt is not 
monotone: more requests can decrease the matching. Furthermore, new requests 
may decrease the number of popt lines that cross a particular point (see f3 x in the 
Section 2). We get around these problems by introducing a monotone quantity, 
the alive parts of the offline matchings (alive), which counts each interval with 
multiplicity roughly equal to the maximum number of popt lines that crossed it 
in the past. We then show that alive is also within a constant factor from popt 
(Subsection 4.2). Finally, it is easy to bound the online cost by alive, and thus 
indirectly with the optimum (Subsection 4.3). 

We conclude and mention some open problem in Section 5. 

2 Properties of the Work Function Algorithm 

The metric online matching problem is defined on a metric space At endowed 
with distance d. A (possibly infinite) subset of S C At is the set of servers. 
Some of these servers are going to be matched with a sequence of requests 
(ri,r 2 , . . .} C At 1 . Let R n = (n, . . . , r n } denote the set of the first n requests 
and let S n = {s ri , ... ,s r } denote the set of servers used by the online algorithm 
to match these requests. 

For the problem we study here the metric space is the 1-dimensional Eu- 
clidean space. Let M(A,R) denote the optimal (minimum) matching between 
sets A and R with |A| = |i?|. Clearly M(A,R) can be obtained by matching the 
leftmost request to the leftmost server and recursing in the same fashion. There 
may be other optimal matchings but we will use the notation M(A,R) for this 
particular matching. For simplicity, we also denote the weight of the optimal 
matching by M(A,R). 

Given the definition of the matching M(A, R) we define 

/3 X (A, R) = \{a : a £ A and a < x}\ — |{r : r £ R and r < x}\, 

which is the number of lines in M(A,R) that cross point x with appropriate 
sign. 

1 Here we assume that all servers and requests are distinct points. It is obvious that 
the general case of allowing multiple servers and/or requests on the same point is 
the limit case when some distances tend to zero. For simplicity we sometimes allow 
multiple servers/requests on points though. Also, all sets are assumed to be multisets. 
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We denote singletons {a} by their unique element a, and the multiset con- 
sisting of k copies of a by a k . In particular, B + a k will denote the multiset that 
results if we add k copies of a to multiset B. 

Definition 1 (Work Function Algorithm). The Generalized Work Function 
Algorithm (7WFAJ matches request r n to an unmatched server s r £ S — S n - 1 
that minimizes the expression: 'yM(S n -i + s r , R n -i + r n ) + d(s r , r n ). 

We state the following easy result without proof. 

Proposition 1. For 7 = 0 this is the greedy algorithm with competitive ratio at 
least 2” — 1, and in general, for 0 < 7 < 1, competitive ratio is exponential, at 
least (£" — 1) / (C — 1) > where £ = 2/(7 + 1). The case 7 = 1 is known simply as 
the Work Function Algorithm, with competitive ratio at least n. The case 7 = 00 
(i.e. ignore d(x,r) and choose s r that minimizes M(S n - 1 + s r ,R n - 1 + r n )) 
is the retrospective algorithm also known as the Permutation algorithm [5] with 
competitive ratio at least n (and at most 2n — 1). 

Our analysis applies to any 7 > 1. There are indications that the minimum 
competitive ratio is achieved for approximately 7 = 3; in particular for the 
special case of the cow-path problem the value 7 = 3 is optimal. We decided to 
state our result for general 7 instead of 7 = 3 because this slight generalization is 
sometimes helpful to the reader (by reducing the number of “magic constants”). 

Let r be a request and let si be the rightmost unmatched server in the 
interval (— 00, r). Similarly let s 2 be the leftmost unmatched server in (r, 00). 
We call the two servers Si and S2 the surrounding servers of r. It is easy to see 
that any online algorithm can be converted into one that services each request 
with one of its two surrounding servers with the same (or better) competitive 
ratio. Our first aim is to show that the 7WFA has this nice property: 

Property 1 (Locality). The 7WFA services each request with one of its two sur- 
rounding servers. 

To show the Locality Property we need to take into account the history of 
the matching created by the 7WFA. To see this consider the following example: 
There are three servers Si,S2,S3 (Figure 1). 

O X O O X 



Fig. 1. Surrounding servers 



The first request 77 is in the interval (si, S2) and is matched to S2. The second 
request is in (S3, 00). If 7WFA has Property 1, then it must match r 2 to S3. This 
can be easily verified, but only if we take into account the history of 7WFA and 
in particular the fact that d(s 1, 77) > d(r 1, S2). This example shows that we need 
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some kind of induction to show that 7WFA has the Locality Property. It turns 
out that the Locality Property follows from the following invariant (in fact, it is 
equivalent to it). 

Theorem 1. Let si , S 2 be two servers that, before time t, have not been matched 
by the 7WFA algorithm but every other server in (si,S 2 ) has been matched. 
Let A, R be the sets of servers and requests before time t in (s\,s 2 ). Then the 
7WFA has matched the set of requests R to the same- cardinality set of servers 
A; furthermore 



7 M(A, R) < 7 M{A + si , R + S 2 ) + d(s 1 , S 2 )' (1) 

Before we proceed to prove the theorem, we point out that the Locality 
Property before time t follows immediately from it and in particular from the 
conclusion that A and R have the same cardinality and are matched by 7WFA. 
Conversely, the theorem follows if we assume that 7WFA has the Locality Prop- 
erty at time t: let the request r at time t be to the right of S2 within very small 
distance; then the 7WFA will prefer to match r to S2 than to Si and this 
gives (1). 

We will also need the following lemma about matchings: 

Lemma 1. Let si, S2 be two servers and let A and R be two sets of points 
(servers and requests) in (si,S 2 ) of equal cardinality. Let also r 1 < r 2 be two 
points (requests) in [si,S2]- Then 

M(A+si, R+ri)+M(A+si,R+r 2 ) Af(A+si+si, R-\-ri~\-r 2 ')-\-M(A, R). ( 2 ) 

Proof. Fix a point x £ [si, J7). The total number of lines that include x of the 
left-hand side is |/3 x (A + si,i?-|-r , i)| + \(3 X (A + s\, R + r 2 )\ = 2| (3 X (A, R) + 1| and 
of the right-hand side is \(3 X (A + si + Si, R + 77 + r2)| + \0 X (A, i?)| = \(3 x (A,r) + 
2 | + \(3 X {A, r)|. Since 2 |a + 1 | < |a + 2 | + |a| (the absolute function is concave), 
it follows that the contribution of x to the left hand side of the inequality is no 
more than the right hand side. A similar situation holds when x £ [ri,r 2 ) and 
x £ [r 2, S2) and the lemma follows. □ 

Proof (of Theorem 1 ). To prove Theorem 1 we use induction on the number of 
requests in R and the properties of the 7WFA. Let r be the most recent request 
in R and let s be the server matched to r by the 7WFA. By induction, the 
Locality Property holds and therefore s is in the interval (si, S2), that is s £ A. 
We consider two cases. 

Case 1: r < s. Let A\ and A 2 be the set of servers in (si,s) and (s,s 2), 
respectively. Let also R\ and R 2 be the associated requests in these intervals 
(requests before r). By induction, A\ (resp. A 2 ) has the same cardinality with 
Ri (resp. R 2 ). 

By the definition of 7WFA we have 7 M (A\ + s, R\ + r) + d(s, r) < 7 M (A\ + 
si, R\ + r) + d(si,r). By the induction hypothesis we also have r yM(A 2 , R 2 ) < 
r )M( y A 2 + s,R 2 + s 2 ) + d(s, s 2 ). Therefore: 
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7 M(A, R) = + s, R\ + r) + 7 M(A 2 , R2) 

< [yM(Ai + si, Ri + r) + d(si,r) — d(s, r)] 

+[ r yM(A 2 + s, i?2 + ^2) + d(s, S2)] 

= 7 M (Ai + A 2 + si + s, Ri + R 2 + r + s 2 ) 

+ [d(si,r) - d(s, r) + d(s, s 2 )] 

< , yM(Ai + A 2 + si + s, R\ + R 2 + r + S2) + d(si, s 2 ) 

= 7 AT(A + si, R + S2) + d(st, s 2 ) 

Case 2 : r > s. Let again A 1: R 1 and A 2 ,R 2 be as in the first case. Then 
M (A, R) = M ( A\ + A 2 + s, Ri + R 2 + r) = AT (Ai , Ri) + AT (A 2 + s, R 2 + r). 

By induction on the left interval, we have r )M{A \ 1 Ri) < jM(Ai + s\, R\ + 
s) + d(si,s). Also by induction on the right interval, we have r yM(A 2 , R 2 ) < 
r yM{A - 2 + s,R 2 + S2) + d(s, s 2 ) or equivalently 

7 AT(A 2 , R 2 ) — 7 M(A 2 + s, R 2 + s 2 ) < d(s, s 2 ). 

But by Lemma 1 we have 

M(A 2 + s, R 2 + r) — M(A 2 + s + s, R 2 + r + s 2 ) < M(A 2 l R 2 ) — M(A 2 -\-s,R 2 -{-s 2 ). 

Combining the two inequalities we get 7 AT ( A 2 + s,R 2 + r ) — 7 AT ( A 2 + s + s, R 2 + 
r + s 2 ) < d(s, s 2 ). Therefore: 

7 AT (A, R) = 7 AT {Ai , Ri) + 7 AT (A 2 + s, R 2 + r) 

< [7 AT (Ai + si , Ri + s) + d(si, s)] 

+[ r yM(A 2 + s + s, R 2 + r + S2) + d(s, $2)] 

= r yM(Ai + A2 + s + si, Ri + R 2 + r + S2) + d(si, S2) 

= 7A T(A + si, R + S2) + d(si, S2) 

Notice the crucial use of Lemma 1 . We remark that it we could get away 
without it when A\ ^ 0 , but it is absolutely necessary otherwise. This is because 
in that case IA2 + s| = \A\ and so we cannot apply induction. 

We also remark that in the second case we used only induction, no property 
of the 7WFA at all. □ 

The following invariant is a generalization of Theorem 1 and we state it 
without proof: 

Theorem 2 . Let s 2 be as in Theorem 1 . Then for any points iR , ... . . , Vk with 
si < vi < v 2 < ■ ■ ■ < v k < s 2 we have 

7 AT (A+sJ , R+i>i + - • '+Ufc) < 7 A T (A+s^ 1 1 R+rq + • • •+u/ s +S2)+d(si, S2) , ( 3 ) 
where s™ denotes m copies of s\. 

Definition 2 (Balanced Interval). Let s\,s 2 be two unmatched servers such 
that inside (si, S2) all servers are already matched. We will call such an interval 
balanced. 

Because of the Locality Property, we can focus our analysis to a balanced 
interval. 
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3 Lower Bound 

We remark that the cow-path problem is a special case of the matching problem 
and this implies a lower bound of 9 for any online algorithm. In the reduction 
to the cow-path problem, there is a server on each integral point of R except 0. 
The first request is at n = 0, and request ?y, t > 1, is at s r _ 1 , the server used 
to service the previous request. 

We will now show a lower bound of i?(logn) on the competitive ratio of 
qWFA. This disproves a conjecture of Kalyanasundaram and Pruhs. We take 
advantage of the Locality Property of yWFA to simplify the argument. 

Theorem 3. The competitive ratio of yWFA is at least 2 + [lognj, where n is 
the number of requests. 

Proof. Without loss of generality n is a power of 2. The set of servers is defined 
as follows: There is one server at 0, one server at n and 2 servers at each of the 
even integers in (0,n). The requests come in stages. In the first stage all odd 
points in (0, n) are requested. Consider the request at 2 j + 1. Using the Locality 
Property, we see that the qWFA can service it by either a server on the left, 
at 2 j, or a server at the right, at 2 j + 2, because they are at equal distance 
from the request. We want to allow the adversary to break the tie. This can be 
enforced by perturbing the request slightly. In particular, we service all requests 
with the servers at positions 2 + 4 k, as shown in Figure 2. Notice now that the 
remaining servers are in a configuration similar to the initial one, only now they 
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Fig. 2. Stage 1 (x denotes request and o denotes server) 




Fig. 3. Stage 2 



are placed apart at distance 4 instead of 2. So we can repeat the construction. 
In particular, in the second stage all points 2 + 4 k are requested and are serviced 
by the servers at points 4 + 8 A;. We repeat it until only the server at 0 remains 
(in the last stage, a request is placed at n/2 which is serviced by the server at 
n). At the end, just to finish the construction we place a request at n and it is 
serviced by the only remaining server at 0. 

The online cost at each stage is n/2 and the cost of the last request is n. 
There are logn stages, so the total online cost is n + ^ logn. The total offline 
cost is easily seen to be n/2. The competitive ratio is 2 + logn. □ 
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4 Upper Bound 

We define the set of points crossed exactly j times by the optimal matching 
M(A,R ) as 

B>(A,R) = {x : /3 x (A,R)=j}. 

We also use the notation B~{A 1 R) = Uj<o-B J ' {A, R) and B + (A,R) = U />o 
Bi(A,R). To keep the expressions simple, we use the same symbols for the 
measure of these sets. Given these definitions, observe that 

M(A,R) = J2\j\-B\A,R). ( 4 ) 

3 

Thus for any balanced interval (si, S2) which includes the set of servers A and 
set of requests R , we have M(A+si, R+s 2 ) = M(A, R) + B + (A, R) + B°(A , R) — 
B~(A , R) and d(si, s 2) = B + (A, R) + B°(A , R) + B~ (A , R). Substituting these 
values in Theorem 1, we have: 

2y B~(A, R) < (7 + 1) d(sl, s2), and 
2 ^B + (A,R) < (7+ l)d(sl,s2). 

Now we are ready to prove a generalization that will be very useful later. 

Lemma 2 . Let (si,S2) be a 7WFA balanced interval which includes the set of 
servers A and set of requests R. For any x € (si,S2) 

2y || B~(A, R) n [si, x]|| < (7 + I) d(si,x), and 
27 1| B + (A 1 R) n [x, s 2 ]|| < (7 + 1) d(x,s 2 ). 

Proof. The intuition behind the lemma is that since si is an unmatched server, 
it was rejected in favor of other servers in the interval (si,S2)- This in turn 
indicates that || B~(A,R) fl [si,x]|| cannot be very large. 

We prove only the first part, the second is similar. Suppose that the statement 
was true before the last request, r t £ (si,^), was matched to a server s r € 
(si, S2)- Let A t - 1 = A — s r and R t -i = R — r t . We consider two cases depending 
on whether s r is to the left of rt or not. The first case when s r < rt is trivial 
because such a request increases the values (3 x (A t -\, Rt-\) ■ 

The other case, rt < s r , is more involved. We first notice that when x is 
in (s!,r t ) then the lemma holds directly by induction. Also if the lemma holds 
for x = s r then it immediately holds for any x £ [s r , s 2 ) from the induction 
hypothesis for the balanced interval (s r ,s 2 ). 

Therefore we concentrate on the case x £ [77, s r ]. To show that the lemma 
holds for x = s r we don’t need the induction hypothesis, only the fact that 
7WFA services r t with s r instead of sp 



jM(A t -i + si, Rt-i + r t ) + d(si,rt ) > 'yM(A t -i + s r , R t - 1 + r t ) + d(r t , s r ). 
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With some work we can rewrite it as 

27 \\B~{A, R) n [si,s r ]|| < (7 + 1) d{si,s r ) -2 d(r t ,s r ), (5) 

which is slightly stronger than the lemma when x = s r . 

Finally, for the case x £ [r t , s r ), we use ( 5 ) and the second inequality of the 
inductive hypothesis for the interval (x,s r ). Specifically, by induction on this 
interval, we get 2 r y\\B + {A t -i,Rt-i) IT [x,s r ]|| < (7 + 1 ) d(x, s r ), which can be 
rewritten as 

-2"/ \\B~ (A, R) n [x,s r ]|| < -(7- 1 )d(x,s r ) (6) 

Summing ( 5 ) and (6) we get 

27 || B~(A,R) IT [si,a:]|| < (7 + l)d(si, x) - 2d(r t ,x) 
and the lemma follows. □ 



4.1 Optimal vs Pseudo-Optimal Matching 

Suppose that after n requests, 7WFA has matched R n to servers S n . The (offline) 
optimal matching though may match R n to some other set S' n . We call the 
optimal matching M ( S n , R n ) the pseudo-optimal matching and denote its weight 
by popt n to distinguish it from the optimal matching M(S' n ,R n ) whose weight 
is opt n . Although 7WFA cannot use the optimal set of servers (in general S n ^ 
S' n ), it has the nice property that it uses an almost optimal set of servers. 



Theorem 4. For any set of servers and requests, 



opt-n < poptn < 



7+1 

7-1 



Opt n 



We now proceed to prove this theorem with the help of Lemma 2 . To prove 
Theorem 4 , we assume without loss of generality that all requests are in a bal- 
anced interval (si, S2), otherwise we sum up the parts for each balanced interval. 
We need to compare popt n = M(S n , R n ) and opt = M(S ' n , R n ), for any S' n such 
that |S'( t | = | S n j = | 1 ?|. We can obtain the matching M(S' n ,R n ) by putting re- 
quests at server points in S n — S' n and matching them to servers in S' n \ S n : 
M (S' n , R n ) = M (S' n U S n , R n + S n — S' n ). Observe now that in the worst case the 
servers in S' n that are outside the interval (si,S2) are at si or S2- Thus, instead 
of proving Theorem 4 , we shall prove the following stronger lemma. 



Lemma 3. Let (si,S2) be a 7WFA balanced interval that contains the set of 
servers A which is matched to requests R. For any points Vi, . . . , Vk+ m in (si, S2) 



Ad (A + + S™ , R + Vi + V 2 + • • • + Vk+m) ^ 
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Proof. Assume v\ < ■ ■ ■ < Vk+ m and denote i>o = si. We concentrate in showing 
the special case with m = 0 and then observe that the same proof applies 
to the general case. Define Cf = || {a: : /3 X (A,R) = j} fl [i>i_i,t 7 ]|| to be the 
measure of points between and which the optimal matching crosses j 
times (with appropriate sign). Applying Lemma 2 to interval (si,i> t ) we obtain 
£f Ei=i Ej< o C j < Eli Ei>o C 1 and summing for t = 1, . . . , k, we get: 

-j k t k t 

^£££c/<£££c/, 

' £= 1 i= l j < 0 t=l 2=1 j>0 

which can be rewritten as 

£ry £ £( fc + 1 - *)Ci < £ £(* + 1 ^ OCi • 

J 2=1 j <0 2=1 j >0 

Using this and the assumption 7 > 1, we can bound 



A/(A, i?) — M(A + 5 ^, R + vi -\ , 1 ;^) 

k k 

= £D iicf-EE |j + fe + 1 - i|C- 

*=1 j i=l i 



< 



2 

7+1 



££uic? 

2=1 j <0 



The last quantity is at most equal to AL^M(A, R) and the lemma follows 
(for m = 0) . When m > 0 the proof is very similar but now the right-hand side 
includes also the terms with j > 0; this is still bounded by A^M(A, R) and the 
lemma holds. □ 



This theorem implies almost immediately that 7 WFA is 0(n)-competitive 
as follows: It is not hard using the f3 x values to verify that d(r n , s r ) < M(S n - 1 , 
Rn-i) + M(S n ,Rn ) which in turn implies that the cost to service request r n is 
bounded above by popt n _i + popt n < ^zj(opt n - 1 + opt n ). Since opt n is non- 
decreasing function (which by the way is not true for popt n ) the claim follows. 
We however show a stronger result and in the process we will discover some 
important properties that may be useful to improve the competitive ratio. We 
will show that the competitive ratio is bounded (up to a constant factor) by the 
maximum number of crossing lines in the 7 WFA matching. 



4.2 Alive vs Pseudo-Optimal 

In this section, we take a closer look at the structure of the pseudo-optimal 
matching, and use this analysis to derive an 0(n) bound on the competitive 
ratio of yWFA. 
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We say that a point x contributes to popt n at time t if 



0 < (3 x {St- 1, 



Rt-i) <Px(S u Rt) <(3 x (S t+1 ,R t+l ),...,(3 x (S n ,R n ) 



or 

0 > P x (S t -i,Rt-i) > j3 x {S tl Rt) > 0x(St+ i ) Rt-\- 1) ) ■ ■ ■ ; Px(Rn i Rn ) 

We say that a point x contributes to alive n at time t if the same inequalities 
hold but not necessarily the leftmost ones (involving 0). For example, in Figure 
4, x contributes to alive g at times 3,6,7, and 8 (which correspond to the bold 
lines; intuitively these are the lines that are visible from the point (oo, 0). In the 
same figure, x contributes to popt s at times 7 and 8 (which correspond to the 
bold lines after the last visit to the horizontal axis). 

































x 2 







B x (S t R t ) 




Fig. 4. aliven and popt„ 



We say that a point contributes to alive n (or opt n ) k times when it con- 
tributes to alive n (or opt n ) at exactly k distinct times. For example, the point 
x above contributes 4 times to alives and 2 times to popts. Clearly popt n is 
the measure of all points, each taken with multiplicity equal to the number of 
times that it contributes to popt n . Similarly, alive n is the measure of all points, 
each taken with multiplicity equal to the number of times that it contributes to 
alive n . 

Theorem 5. 

y — 1 

— — alive n < popt n < alive n . 

2q 

Proof. The second inequality is obvious from the definitions. The first inequality 
is based on Lemma 2. Consider a request ry which is serviced by request s r and 
assume without loss of generality that s r < r t . The points that contribute to 
alive n at time t form an interval (s r , q) for some point q. By Lemma 2 at least 
a fraction of this interval has /3 x (A t _ 1 , Rt-i) > 0. This fraction contributes 
also to popt n at time t. □ 
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4.3 Alive vs Cost 

We can now put everything together to obtain the desired upper bound. We 
have considered the four quantities: opt n , popt n , alive n and cost n (the last one 
is the weight of the online matching). Theorems 4 and 5 establish that the first 
three quantities are almost equal within some constant factors: 

27 7 + 1 

optn < poptn < alive n < -optn 

7 — 1 7 — 1 

It is remarkable though that opt n and alive n are non-decreasing while popt n is 
not. Furthermore, the cost to service request r t is at most equal to alivet (each 
point in the interval (ry, s r ) contributes to alivet). Therefore we can bound the 
total costn by 



n 

cost n < alivet < n ■ alive n < 
t=i 



2q 7+I 
7 — 1 7 — 1 



• n ■ optn 



We showed 

Theorem 6. The 7WFA has competitive ratio at most 737733; ■ n = O(n), 
where n is the number of requests. In particular, when each point x is crossed by 
at most k lines of the matching 0/7WFA, the competitive ratio is O(k). 



5 Conclusions and Open Problems 

For the online matching problem on a line, we showed that the WFA has com- 
petitive ratio between !?(logn) and O(n). We believe that the lower bound is 
tight and that the tools we developed here can be very useful. We don’t know 
whether there exists some other algorithm which has constant competitive ratio. 

One important property is the relation between popt and opt which states 
that although WFA may not use the optimal subset of servers — the one that 
minimizes the matching with the requests — it uses an almost optimal one (up 
to a constant factor). This may be true for arbitrary metric spaces. 

For the Euclidean space i? , the lower bound for the cow-path problem, 
h?(n 1-1 / d ), can be extended to the matching problem. Is this tight and what is 
the competitive ratio of WFA? Finally, randomized algorithms may have much 
better competitive ratio; the best randomized lower bound for arbitrary metric 
spaces is only !?(logn) [6]. 
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Abstract. In the classical whack-a-mole game moles that pop up at 
certain locations must be whacked by means of a hammer before they 
go under ground again. The goal is to maximize the number of moles 
caught. This problem can be formulated as an online optimization prob- 
lem: Requests (moles) appear over time at points in a metric space and 
must be served (whacked) by a server (hammer) before their deadlines 
(i.e., before they disappear). An online algorithm learns each request 
only at its release time and must base its decisions on incomplete infor- 
mation. We study the online whack-a-mole problem (wham) on the real 
line and on the uniform metric space. While on the line no deterministic 
algorithm can achieve a constant competitive ratio, we provide compet- 
itive algorithms for the uniform metric space. Our online investigations 
are complemented by complexity results for the offline problem. 

1 Introduction 

In the popular whack-a-mole game moles pop up at certain holes from under 
ground and, after some time, disappear again. The player is equipped with a 
hammer and her goal is to hit as many moles as possible while they are above 
the ground. Clearly, from the viewpoint of the player this game is an online 
optimization problem since the times and positions where moles will peek out 
are not known in advance. She has to decide without knowledge of the future 
which of the currently visible moles to whack and how to move the hammer into 
a “promising” position. What is a good strategy for whacking moles (if there 
exists any)? How much better could one perform if one had magical powers 
and knew in advance where moles will show up? How hard is it to compute an 
optimal solution offline? In this paper we investigate all of the above questions. 

The whack-a-mole problem with popup duration T > 0 (wham^) can be 
formulated in mathematical terms as follows: We are given a metric space M = 
(X,d) with a distinguished origin 0 £ X and a sequence a = (rq, . . . , r TO ) of 
requests. A server (hammer) moves on the metric space M at unit speed. It 
starts in the origin at time 0. Each request rj = ( tj,pj,rij ) specifies a release 

* Supported by the DFG Research Center “Mathematics for key technologies” 
(FZT 86) in Berlin. 

** Research supported by the German Science Foundation (DFG, grant Gr 883/10). 
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time tj and a point (hole) pj £ X where rij moles pop up. A request rj is served 
if the server reaches the point pj in the time interval [tj , tj + T\. We will refer 
to tj + T as the deadline of the request. The goal is to whack as many moles as 
possible. If no confusion can occur we write only wham instead of WHAMy. 

An online algorithm learns about the existence of a request only at its release 
time. We evaluate the quality of online algorithms by competitive analysis [5], 
which has become a standard yardstick to measure the performance. An algo- 
rithm ALG for wham is called c-competitive if for any instance the number of 
moles whacked by ALG is at least 1/c times the number of moles caught by 
an optimal offline algorithm OPT. The first two of the questions raised above 
amount to asking which competitive ratios are achievable by online algorithms. 

In this paper we mainly study the wham on two different metric spaces: the 
line and the uniform metric space. The line is the classical setup for the whack- 
a-mole game. The motivation for studying the uniform metric space (a complete 
graph with unit edge weights) is that “in practice” it does barely matter between 
which points the hammer is moved: the main issue is whether the hammer is 
moved (and to which point) or whether it remains at its current location. 

Related work. The wham falls into the class of online dial-a-ride problems. In 
an online dial-a-ride problem objects must be transported between points in 
a metric space by a server of limited capacity. Transportation requests arrive 
online, specifying the objects to be transported and the corresponding source and 
destination. If for each request its source and destination coincide, the resulting 
problem is usually referred to as the online traveling salesman problem (OlTsp). 
The wham is the OlTsp with the objective to maximize the (weighted) number 
of requests served before their deadlines. 

The OlTsp has been studied for the objectives of minimizing the makespan 
[1, 2, 7], the weighted sum of completion times [7, 13], and the maximum/ average 
flow time [9, 14] . Since dial-a-ride problems (where sources and destinations 
need not coincide) can be viewed as generalizations of scheduling problems (see 
e.g. [1]), lower bounds for scheduling problems carry over. In [3], Barualr et al. 
show that no deterministic algorithm can achieve a constant competitive ra- 
tio for the scheduling problem of maximizing the number of completed jobs. 
Kalyanasundaram and Pruhs [11] show that for every instance at least one of 
two deterministic algorithms is constant competitive, and thus they provide a 
randomized algorithm which is constant competitive. However, it is not clear 
whether and how their results carry over to the more general class of dial-a-ride 
problems. 

The wham has also been investigated in [10] under the name “dynamic 
traveling repair problem” . The authors give two deterministic algorithms for 
the WHAMy in general metric spaces with competitive ratios that are formu- 
lated in terms of the diameter of the metric space. Their ratios translated into 
the notation used in this paper and restricted to the uniform metric space are 
and 4 TjPy ^ + l^j , respectively. We improve these results in several 

ways for the case of the uniform metric space. For instance, for popup duration 
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Table 1. Competitiveness results for the WHAMy on the uniform metric space where 
each mole stays T units of time above ground. Here, N denotes the maximum number 
of moles (possibly stemming from different requests) that are peeking out of the same 
hole, simultaneously, for a positive amount of time. 



Popup duration 


upper bound 


lower bound 


T > 2 


ri T / 2 J + t \ r 51 

LT/2J €h3 ’ 5J 

(see Figure 2) 


2 


1 < T < 2 


2N 


2 


T = 1 


2 (for all tj integral) 

2N (for general tj) 


2 (for all tj integral) 

2N (for general tj) 



T — 2, our algorithm iwtm achieves a competitive ratio of 3, while the results 
in [10] yield a ratio of 80. Moreover, for T = 1 our algorithms are the first 
competitive ones, since the bounds of [10] cannot be applied. The paper [10] 
also shows that there is a metric space in which no deterministic algorithm can 
achieve a constant competitive ratio. We show that this results is already true 
on the real line. 

Our contribution. The contributions of this paper are twofold. First, we pro- 
vide complexity results for the offline problem OFFLINE-WHAM. We derive a 
dynamic program for the OFFLINE-WHAM on unweighted graphs with integral 
release times and deadlines, which runs in time 0(nm(T + m)(A + 1) 2T ), where 
n is the number of nodes in the space, m denotes the number moles and A is 
the maximum degree. The algorithm runs in polynomial time, if (A + 1) 2T is 
bounded by a polynomial in the input size. We complement our solvability result 
by NP-hardness results for some special cases of the OFFLINE- wham. 

Our main contribution lies in the analysis of the online problem wham. We 
show that no deterministic algorithm for the wham on the line can achieve 
a constant competitive ratio. From the viewpoint of the whack-a-mole player, 
the situation is much better on the uniform metric space. Our results for this 
case are summarized in Table 1. In particular, we provide the first competitive 
algorithm for short popup durations T = 1 and substantially decrease the known 
competitive ratio for T > 1. Our competitiveness results are complemented by 
lower bounds on the competitive ratio of arbitrary deterministic algorithms. 
The upper bounds hold against the most powerful adversary, whereas the lower 
bounds are shown against the non-abusive adversary [14], which is the most 
restricted adversary that we consider. 



2 The Complexity of Offline Whack-a-Mole 

In this section we investigate the complexity of the offline problem OFFLINE- 
WHAM where all moles and their respective release dates are known in advance. 
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We first give a polynomial-time algorithm for a special class of the problem. 
Then, we show that offline- WHAM is NP-lrard on the line and on the star 
graph. In this section we slightly diverge from the notation used for the online 
problem in allowing more general deadlines dj > tj for the requests than just 
dj = tj + T, where T is the popup duration. In this more general context T will 
denote the maximum popup duration of any mole. 



2.1 When Whacking Is Easy 

We consider the following scenario: The metric space M = (X, d) has n points 
and is induced by an undirected unweighted graph G = (V, E) with V = X, i.e., 
for each pair of points from the metric space M we have that d{x, y) equals the 
shortest path length in G between vertices x and y. We also assume that for 
each mole the release date tj > 1 and the deadline dj are integers. 

Theorem 1. Suppose that the metric space is induced by an unweighted graph 
of maximum degree A. Then, the OFFLINE-WHAM with integral tj and dj can be 
solved in time 0(nm(T + m)(A + 1) 2T ). where T := maxi<j< m (dj — tj) is the 
longest time a mole stays above the ground; n denotes the number of vertices in 
the graph and m is the number of requests. 

Proof. The time bound claimed is achieved by a simple dynamic programming 
algorithm. Let 0 < t\ < t 2 < • • • < tk with k < m be the (distinct) times where 
moles show up. We set to := 0. 

The idea for a dynamic programming algorithm is the following: For each 
relevant point t in time and each vertex v £ V we compute the maximum 
number of moles caught subject to the constraint that at time t we end up at v. 
Essentially the only issue in the design of the algorithm is how one keeps track of 
moles that have been whacked “on the way” . The key observation is that for any 
time t that we consider the only moles that need to be accounted for carefully 
are those ones that have popped up in the time interval [f — T,t]. Any mole that 
popped up before time t — T will have disappeared at time t anyway. This allows 
us to use a limited memory of the past. 

Given a vertex v, a history track is a sequence s = (v\,V 2 ,...,Vk = v) 
of vertices in G such that for i = 1 ,...,k we have d(vi,Vi+ 1 ) = 1 whenever 
Vi ^ Vj+i. We define the time-span of the history track s to be d(s) = k. 
The history track s encodes a route of starting at vertex v± at some time t, 
walking along edges of the graph and ending up at v at time t + d(s) with the 
interpretation if Vi = we remain at vertex V; t for a unit of time. Notice that 
in an unweighted graph with maximum degree at most A, there are at most 
(A + 1) L history tracks of length LeN ending at a specific vertex v. 

Given the concept of a history track, the dynamic programming algorithm 
is straightforward. For t £ {to, ■ ■ ■ ,tk}, v £ V and all history tracks s, with 
d(s) = min (t,T), ending in v at time t, we define M[t,v,s\ to be the maximum 
number of moles hit in any solution that starts in the origin at time 0, ends in v 
at time t, and follows the history track s for the last d(s) units of time. 
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The values M[0, v, s] are all zero, since no mole raises its head before time 1. 
Given all the values M[t, v, s] for all t = to, . . . , tj- 1 , we can compute each value 
M[tj,v,s] easily. 

Assume that tj < tj-i+T. Then, from the history track s we can determine 
a vertex v' such that v' must have been at vertex v' a time tj- 1 . This task can 
be achieved in time 0(T) by backtracking s. The value M[tj,v,s) can now be 
computed from the 0((A + 1) T ) values M[tj-\,v',s'} by adding the number of 
moles whacked and subtracting the number of moles accounted for twice. The 
latter task is easy to achieve in time 0(T + m) given the history tracks s and s'. 
Hence, the time needed to compute M[tj,v, s] is 0((T + m)(A + 1) T ). 

It remains to treat the case that tj > tj-i+T. Let t := tj-i+T. Notice that 
no mole can be reached after time t and before time tj, since all moles released 
no later then tj- \ will have disappeared by time t. Any solution that ends up at 
vertex v at time tj must have been at some vertex v' at time t. We first compute 
the “auxiliary values” M[t,v',s'] for all v' £ V and all history tracks s by the 
method outlined in the previous paragraph. Then, the value M[tj,v,s) can be 
derived as the maximum over all values M[t,v' , s'], where the maximum ranges 
over all vertices v' such that v can be reached by time tj given that we are at v' 
at time t and given the histories s and s' (which must coincide in the relevant 
part) . 

Since the dynamic programming table has 0(nm(A + 1) T ) entries, the total 
time complexity of the algorithms is in 0(nm(T + m)(A + 1) 2T ). □ 

The above dynamic program can easily be adjusted for metric spaces induced 
by weighted graphs with integral edge weights. Each edge e is then replaced by 
a path of w(e) vertices, where w(e ) denotes the length of edge e. The time 
bound for the above procedure becomes then 0(nm(T + m)(A + 1) 2T ), where 
n = n + ““ !)• Hence, whenever ( A + 1) 2T is pseudo-polynomially 

bounded, OFFLINE-WHAM can be solved in pseudo-polynomial time on these 
weighted graphs. 

2.2 When Whacking Is Hard 

It follows from Theorem 1 that OFFLINE-WHAM can be solved in polynomial 
time if ( A + 1) 2T is bounded by a polynomial in the input size. On the other 
hand, the problem on a graph with unit edge weights, all release times zero and 
all deadlines equal to n, the number of holes, contains the Hamiltonian Path 
Problem as a special case. Thus, it is NP-lrard to solve, see e.g. [15]. 

Another special case of the OFFLINE-WHAM is obtained when at most one 
mole is in a hole at a time, the metric space is the line and release dates as well 
as deadlines are general. Then this problem is also NP-hard by a reduction from 
partition, as mentioned in [6]. 

The first case we consider is the weighted version of OFFLINE-WHAM on the 
line where multiple moles remain above ground for a fixed time. 

Theorem 2. offline- WHAM on the line is NP-hard even if all moles stay above 
ground is equal for all moles, i.e., di — U = dj — tj for all requests ri,rj. 
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Proof. We show the theorem by a reduction from partition, which is well known 
to be NP-complete to solve [12, 8]. An instance of partition consists of n items 
at £ Z + , i = 1, . . , , n, with a* = 2 B. The question is whether there exists a 
subset S C {1, . . . , n}, such that = -S- 

Given an instance of partition, we construct an instance / W ham for OFFLINE- 
WHAM, with m = 3n requests. Let B = an( i K = B + 1. The time 

each mole stays above ground is T = 2 B. There are 2m requests rf and r~ , 
i = 1, . . . , m where rf is released at time (2i— 1 ) K and has deadline (2i—l)K+T. 
The position of rf is K+ai with weight K+ai, and the position of r~ equals —K 
with weight K. Finally, there are m requests rf in the origin, where is released 
at time 2iK, has deadline 2iK + T, and weight K . 

We claim that at least 2riK + B moles can be whacked if and only if I is a 
YES-instance for PARTITION. 

Let S' be a partition of /, i.e. , Yfics a i = B. Then whacking the moles of 
requests in the order (rf 1 , rf . . . , r“ , rf), where an = + if i £ S and on = — if 
i & S, is feasible and yields the desired bound, as tedious computation can show. 

Suppose conversely that there exists a route for the whacker such that it 
reaches at least 2nK + B moles. Notice that as the locations of the holes of 
requests rf and r~ are at least 2K > 2 B apart, the whacker can whack at 
most one of these requests. The moles of requests rf and r~ pop up after time 
tf_ 1 + T, and therefore the whacker cannot catch the moles of request rf_ l and 
rf at the same time. The same is true for requests r° i _ 1 and rf Suppose the 
whacker moves to the hole of rf or r~ after first whacking the moles of rf The 
earliest possible arrival time in the mole is at least 2iK + K = (2 i + 1 )K and by 
this time the moles of rf and r~ have gone down again. Hence, when whacking 
rf and either rf or r~ , the request rf or r~ need to be whacked before rf Not 
whacking the moles of rf or none of rf and r~ , results in a tour in which at 
most (2 n — 1 )K + 2 B < 2nK + B can be caught. Therefore, the whacker needs 
to reach all moles popping up in the origin and for each i it also needs to whack 
all moles of either rf or r~ . Hence, by the above considerations we know that 
when at least 2nK + B moles are whacked, the whacker needs to hit first the 
moles of rf or r~ and then those of rf before going to the hole of request rf +l 

or r i+i- 

Let S = {i : moles of rf are whacked} be the set of requests served in the 
positive part of the line. We claim that XagS ai = Obviously a * — & 
since the number of moles whacked is at least 2nK + B. Suppose that a » > 
B and let S' C S be the smallest subset of S such that if i,j £ S with i < j and 
j £ S' then i £ S' and f2 ie g> a?: > B and let k = max S'. Then YfieS’\{k} ai — 
The whacker leaves the origin for request rf at time 2(k—l)K+2 Yf ieS'\{k} a » — 
2 (k — 1 )K + 2 B < tf The next time the whacker reaches the origin is 2kK + 
ai > + 2 B and by then the moles of request r° have gone under 

ground. Hence, it cannot reach the moles of request r° and is not able to whack 
2nK + B moles. □ 

Our next hardness result concerns the case of a star graph. 
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Theorem 3. OFFLINE-WHAM is NP-hard on a star graph with arbitrary edge 
lengths. This result holds, even if at any moment in time at most one mole is 
peeking out of hole. 

Proof. Let (oq, . . . , a n ) be an instance for partition and let B = | ]>A aq. We 
construct a star graph with n + 1 leaves. The length of the center to leaf i is 
equal to a*, for * = 1, . . . , n and the length of the (n + l)st leg is equal to 1. The 
request sequence is as follows. At time t = 0, a single mole is released in each of 
the leaves j = 1, . . . , n. The popup duration of each mole is equal to 4 B + 2. At 
time 2 B + 1 a mole appears in leaf n + 1 with zero popup duration. Finally, at 
time 4 B + 2 a mole pops up in the center, also with zero popup duration. 

If we have a YES-instance of partition, i.e., there exists a subset S C 
{l,...,n} such that J2 iG s a i = the whacker can catch all moles on the 
following route. First she whacks the moles in the leaves i G S in an arbitrary 
order and arrives at the center at time 2 B. Then, she whacks the mole in leaf 
n + 1 after which all the remaining leaves can be visited and the whacker returns 
in the center at time 2 + JA a,:, at which she can whack the mole that pops up 
there. 

If there exists a route in which the whacker can reach all moles, then by 
time 2 B + 1 she needs to be in leaf n + 1. Therefore, no later than time 2 B 
she needs to be in the center. Let S be the set of moles whacked before this 
time. After whacking the mole in leaf n + 1 and returning to the origin, there is 
still 2 ]>A a,; — 2 B time left to whack all the unwlracked moles and return in the 
center. Therefore, it must hold that ^A gg a i~ B. □ 

3 Whack-a-Mole on the Line 

In this and the following section we investigate the existence of competitive al- 
gorithms for the WHAM. Our results are not only established for the standard 
adversary, the optimal offline algorithm, but also for the more restricted adver- 
saries of [4, 14] , which we recall briefly. 

The optimal offline algorithm is often considered as an adversary, that spec- 
ifies the request sequence in a way that the online algorithm performs badly. 
Besides the ordinary adversary that has unlimited power, there exist several 
adversaries in the literature that are restricted in their power. The fair adver- 
sary, introduced by [4] , may move at any time only within the convex hull of all 
requests released so far. An even more restricted adversary is the non-abusive 
adversary of [14]. This adversary is defined on the line, and it may only move 
into a certain direction if there is still a pending request in this direction. For 
WHAM we extend this definition by restriction that the adversary may only move 
in the direction of a request that it can reach before the deadline of this request. 
A natural extension to other metrics is an adversary that may only move on a 
direct path to a pending request, which deadline can be met. 

Theorem 4. Let T > 0 be arbitrary. No deterministic online algorithm can 
achieve a constant competitive ratio for WHAMy on the line. 
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Proof. Consider an online algorithm ALG and assume w.l.o.g. that its position 
at time T is Palg(T) < 0. The sequence consists of one mole, which pops up at 
time T at position T + 1. As the adversary can be at position T by time T, it 
has killed the mole by time T + 1. ALG, on the other hand, can not be in T + 1 
before time 2 T + 1 and by then the mole has gone under ground again. □ 

In the proof above the adversary abuses its power in the sense that it moves 
to a point where a mole will pop up without revealing this information to the 
online whacker. Theorem 4 can be extended to both, the fair and the non-abusive 
adversary as is shown in the following theorem. 

Theorem 5. Let T > 0 be arbitrary. No deterministic online algorithm can 
achieve a constant competitive ratio for the whaMt on the line against a non- 
abusive adversary. 

Proof. Consider an arbitrary deterministic online algorithm. At time 0 two moles 
appear: one in T and the other in —T, both going down at time T. If the online 
whacker does not reach any mole by time T then the sequence stops. Otherwise, 
we assume w.l.o.g. that the online whacker is in position p ALG (T) = —T at time T. 
ADV is at that time in position T. Then, from time t = T onwards a request 
is given in t + 1 at each integral time T, T+l,T+2,.... The adversary can 
whack all these moles, whereas the online whacker is not able to meet any of the 
deadlines. □ 

4 Whack-a-Mole on the Uniform Metric Space 

Recall that the uniform metric space is induced by a complete graph with unit 
edge weights. The number of vertices in this graph is n. Observe that for T < 1 
trivially no deterministic algorithm can be competitive against the standard or 
fair adversary. In case of the non-abusive adversary, the situation is trivial again, 
since the adversary can never catch any mole except for those in the origin. 



4.1 How Well We Can’t Whack 

We remark that a lower bound of 2 for the wham on the uniform metric space 
has been derived in [10]. Our construction uses fewer nodes in the metric space 
and, more important, there is no positive amount of time where more than one 
request is available at a single hole. Also, note that the lower bounds are shown 
against the most restricted adversary of those defined in the previous section. 

Theorem 6. Let n > 3 T + 2, that is, T < (n — 2)/3. No deterministic online 
algorithm for the WHAMt can achieve a competitive ratio smaller than 2 against 
a non-abusive adversary. 

Proof. At each time t = 0, . . . ,T — 1 the adversary releases two moles: one in 
p t , i = 2f + 1 and the other in p t ^ = 2f + 2. At time t = T, T + 1, . . . three moles 
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are released: two moles are released in empty holes pt;i and p t p and the third 
mole, either, in p t , 3 = Pt.-T , 2 if ALG is in pt.-TA at time t, or, in p t> 3 = pt-T, 1 , 
otherwise. Note that at time t , at most 3 T moles have deadline at least t, and 
as n > 3 T + 2, there are at least two holes left with no moles at time t. 

ALG cannot whack more than one mole per time unit, whereas ADV can kill 
two moles at a time from time t = T onwards. □ 

In the sequel the maximum number of moles available at a single hole will 
play a crucial role. We define N to be the maximum number of moles peeking out 
of the same hole for a positive amount of time. Notice that with this definition, 
there might be a moment in time, where in fact 2N moles are above ground at 
the same place. Consequently, an algorithm might whack up to 2 N moles in a 
single step. 

Theorem 7. If at most N moles can be in the same hole for a positive amount 
of time, then any deterministic online algorithm for the WHAM! against a non- 
abusive adversary has a competitive ratio no less than 2 N. 

Proof. After an initial step, a non-abusive adversary constructs a sequence con- 
sisting of phases such that in each phase it whacks at least 2 N times as many 
moles as ALG does. Each phase starts at a time t when the adversary arrives in 
a hole. We denote by t' the latest deadline of the moles that are in this hole at 
time t. Note that t <t' < t + 1, since the popup duration is 1. There are two 
possible positions for ALG to be at time t: 

Case (a) ALG is in a vertex point different from the position of ADV; 

Case (b) ALG is on an edge. 

Moreover, if there are at the beginning of the phase some pending requests 
released before time t then ALG cannot reach any of them. 

In Case (a), two moles are released at time t in holes where neither ALG nor 
ADV are. If ALG does not immediately go to one of these moles, it cannot whack 
any of them, whereas the adversary catches one of these moles. Otherwise, at 
time t — max{f', t + 1/2} the adversary releases N moles in his current position 
and N moles in a hole v that is not incident to the edge on which ALG is. Thus, 
ALG cannot whack any of them. Hence, it whacks at most one mole, whereas ADV 
reaches 2 TV moles by remaining in his position until time t and then moving to v. 

In Case (b), ALG is in the interior of an edge and thus, it cannot reach any 
vertex point which is not incident to this edge by time t + 1. The adversary 
releases one mole in a free hole, i.e., a vertex point where no mole is and which 
is not incident to the edge on which ALG is. Hence, ALG does not whack any 
mole, and ADV hits one mole. 

An initial sequence consisting of two requests in two different holes each 
releasing a single mole ensures that we end up either in Case (a) or Case (b). 
This completes the proof. □ 

Note that in the proof of the above lower bound we use the fact that the 
release dates may be non-integral. As we see in the next section, this restriction 
is essential, because for integral release dates we are able to show that there 
exists a 2-competitive algorithm. 
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4.2 How Well We Can Whack 

In this section we propose simple algorithms for wham and give performance 
guarantees for the online problem on a uniform metric space. 

First Come First Kill (fcfk) 

At any time t, move to a hole which contains a request with earliest 
release date, breaking ties in favor of the point where the most moles are 
above ground. If none of the moles that are above ground can be killed 
by the algorithm, then the whacker does not move. 

Theorem 8. Let T > 1. Consider the WHAMj- with at most N moles peeking 
out of one hole for a positive amount of time, fcfk is 2N -competitive, which is 
tight. 

Proof. Partition the input sequence into maximal subsequences, such that each 
subsequence consists of requests that FCFK serves continously, i.e. , it is constantly 
moving between holes. We show that OPT whacks at most 2 N times as many 
moles as FCFK does for each subsequence from which the theorem follows. 

Consider such a subsequence a'. We denote by C/ LG the time where algo- 
rithm ALG whacks request r y - . If rj is not caught, then we set C/ LG = oo. Define 

imax = max{ Cf T : rj £ a' and C° FT < oo }. 

We define t m - m such that f max — t m - ln is integral and min j £(T / tj < t m i n < 
min j ecr i tj + 1. In each interval (t,t+ 1] for t = t m i n , . . . , f max — 1, FCFK hits at 
least one mole and OPT cannot whack more than 2N moles. It remains to show 
that the moles which are reached by OPT before t m - m can be compensated for by 
FCFK. 

If fcfk whacks its last mole of a' no later than time f max , then opt catches 
at most N moles in the interval (t max — l,fmax] since no new request can be 
released at f max due to the maximality of the subsequence a'. Moreover, OPT 
can kill at most N moles in the interval (minj £cr / tj, f m i n ]. Therefore, the number 
of moles reached by OPT during the period before f m i n can be accounted for by 
the moles caught in the last interval by OPT and thus, sum up to at most 2N. 

On the other hand, if fcfk still whacks a mole from a' after time f max , the 
number of moles caught by OPT during the first period is at most N times the 
number of moles hit by FCFK after f max . 

Lemma 1 implies that the competitive ratio of 2N is tight for fcfk. □ 

Lemma 1. Let T > 1 he an integer. Consider the WHAM-r with at most N moles 
peeking out of one hole for a positive amount of time. FCFK has no competitive 
ratio less than 2 N against a non-abusive adversary. 

Proof. At time t — 0, the adversary releases T requests in holes 1, . . . , T, each 
of them with weight 1. At time t = 1/2, in hole 2 T + 1, a request is released 
with N moles. At time t, for t = 1, . . . , T — 1, one request in hole T + t is given 
with one mole and at time t+1/2 a request with N moles is given in 2 T + 1 + 1. 
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hole 

T T + 2 2 T 2T + 2 3T + 1 4 T 




Fig. 1 . Lower bound sequence for fcfk. Each request is represented by a vertical line 
between the release date and deadline. Thick lines illustrate requests with N moles, 
the thin lines depict requests with single moles. The line segment is dashed after the 
request has been served by ADV, and dash-dotted after being served by fcfk and ADV. 
Notice that from time T onwards, fcfk serves all its requests by their deadlines. 



At time t, for t = T,T + 1, . . ., one mole is popping up in hole 1 + (T + t — 1) 
mod 2 T . And at time t + 1/2 two requests are given, each with N moles: one in 
2T + 1 + (t mod T) and one in 3T + 1 + (t mod T). This sequence is visualized 
in Figure 1. 

Up to time T, FCFK whacks the moles released at time 0. After time T it 
moves to the hole with the earliest released request that it can reach. As the 
requests with N moles are released 1/2 time unit later than the requests with 
a single mole, FCFK is not able to whack any of the higher weighted requests. 
Hence, it catches in each unit length time interval one mole. In each unit length 
interval from time T + 1/2 onwards, there is one hole where a request with N 
moles has its deadline and a new request with N moles is released. Hence, ADV 
can whack 2N moles in every unit length time interval after time T. □ 

Recall that no deterministic online algorithm can be better than 2-competitive 
(Theorem 6). Hence, by Theorem 8 we know that FCFK is optimal in the un- 
weighted setting, i.e. , at most one mole can be peeking out of a single hole for a 
positive amount of time. Also, in the weighted case with T = 1, FCFK is optimal 
by Theorem 7. For the special case of integral release dates and T = 1, FCFK 
obtains a competitive ratio of 2 even in the weighted setting. 
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Theorem 9. Consider the WHAMj where each mole stays T = 1 time unit above 
ground, fcfk is 2-competitive if all release dates are integral. 

Proof. Due to the integral release elates, both, opt and fcfk are in holes at 
integral points in time. Moreover, OPT serves at most two requests released at 
the same time because of the unit popup duration. FCFK on the other hand, 
whacks at least the moles of one request released at a certain time and by 
definition it chooses the request with the highest number of moles. Therefore, it 
reaches at least half of the moles whacked by the optimal offline algorithm. □ 

Obviously, fcfk’s flaw lies in ignoring all requests with a later deadline even 
though they could contribute with a higher weight to the objective value. In order 
to overcome this drawback we consider an other algorithm which we call Ignore 
and Whack the Most (iwtm). In this algorithm, we divide the time horizon in 
intervals of length l = [?rj, and we denote these intervals by J* = ((i — 1)1, il), 
for i = 0, 1, 2, . . . , L, where II is the last interval in which moles are whacked. 
We say that at time t, the current interval is the interval Ii for which t € Ii. 
Note that these intervals only have a positive length for T > 2. 

When formulating the algorithm iwtm we allow the algorithm to whack only 
a subset of the moles available at a certain hole. Although our problem definition 
would force all moles at v to be whacked, this condition can be enforced within 
the algorithm by keeping a “virtual scenario” . 

Ignore and Whack the Most (iwtm) 

At any time when the whacker is at a hole, it moves to the hole with 
the highest number of pending moles released before the current interval 
that can still be reached in time. Only those moles will be whacked at 
the hole. 

Theorem 10. Let T > 2 and c = ■ iwtm is c-competitive for the 

WHAM T . 

Proof. Let ki denote the number of moles released in interval whacked by OPT, 
and let hi denote the number of moles whacked by IWTM during interval /j. Then 

OPt(ct) = ^2 ki, and iwtm(cj) = ^ hi. (1) 

i i 

Moreover, since no moles are released in the last interval II , it follows that 

k L = 0* 

First note that IWTM is at integral time points always in a hole. Therefore, 
during interval /j+i it can visit l holes. If it visits less than l holes, then the 
number of requests released in interval is less than l. Hence, OPT cannot kill 
more than hi+i moles of those released in A. 

Conversely, suppose that IWTM visits exactly l holes during I i+ 1 . The op- 
timum can visit at most \l + T] holes of requests released in interval I t . By 
definition iwtm serves the l holes with the highest weight of pending requests 
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released at or before time il. Therefore, hi+± > (l/\l + Hence, by Equa- 

tions (1), we know that 

iwtm(ct) > (l/\l + T])0PT{a). 

Recall that l = \T/2\. For T ranging from 2 to 20, the values of the competitive 
ratio are depicted in Figure 2. □ 



C 

5 -- 
4 -- 




3 -- ■ 



2 4 6 8 10 12 14 16 18 20 T 

Fig. 2. Competitive ratio for iwtm for T G [2, 20]. 

5 Concluding Remarks 

We have provided the first competitive algorithms for the WHAM on the uniform 
metric space for small popup duration and improved known competitiveness 
results for larger values. For the case of non uniform popup durations a natural 
extension of the FCFK algorithm would be the Whack the Wimp algorithm: 
always move to a hole containing a request with earliest deadline (breaking ties 
just as in FCFK). If all popup durations are at least 1, then using the same 
analysis as for FCFK we can show the same bounds on the uniform metric space. 
In the case that there are popup durations less than 1, no deterministic algorithm 
can be constant competitive. 

Our lower bound results show that the situation on the real line is hopeless 
in terms of competitiveness, at least for deterministic algorithms. This raises the 
question whether randomized algorithms can do better. 
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Abstract. We study the competitiveness of online deadline scheduling 
problems. It is assumed that jobs are non-preemptive and we want to 
maximize, in an online manner, the sum of the length of jobs completed 
before their deadlines. When there is a single machine, Goldwasser [4] 
showed that the optimal deterministic competitiveness of this problem is 
2 + | , where each job of length L can be delayed for at least k ■ L before 
it is started, while still meeting its deadline. 

In this paper we generalize the framework of the above problem in two 
ways: First we replace the adversary by a set of m weak adversaries , 
each of which can generate arbitrary jobs that never overlap each other. 
Assuming that job sequence is generated by the team of m weak adver- 
saries for some fixed m, we present a tight analysis of an optimal online 
algorithm by extending the previous analysis by Goldwasser. Next we 
allow online algorithms to abort the currently running job and to restart 
it from the scratch, if it can meet the deadline. This capability of abort 
and restart is different from preemptiveness because only jobs that are 
scheduled without interrupt are regarded to be successfully scheduled. 
We present a constant competitive algorithm for this model. 

Keywords: Analysis of algorithms; Online algorithm; Real-time schedul- 
ing 



1 Introduction 

Online Admission Control. Imagine there is one resource (communication link, 
CPU, etc.) that services an online sequence of jobs. It is assumed that each job 
has a processing time and a firm deadline by which the job should be completed. 
A job is accepted if it runs non-preemptively (without interrupt) and is completed 
before its deadline. We aim to find a schedule of jobs that maximizes the sum 
of the lengths of accepted jobs. We consider the online problem. 

Model. A job J 3 is defined to be a triple of non-negative integers ( rj,pj,dj ), 
where r 3 is the arrival time, pj is the length of the processing time, and dj is 
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the deadline by which the job should be completed. Let = dj — pj denote the 
expiration time - the last possible time at which Jj could be started while still 
meeting the deadline constraint - and let Sj = dj—pj—Tj denote the slack (called 
laxity in [5]), which is the amount of time between r ; - and the expiration time 
ej. The gain of an algorithm A , denoted Gain(A), is the sum of the length of 
the accepted jobs. The competitive ratio of an online algorithm A is the worst- 
case ratio between Gain(A) and the gain of the optimal algorithm with full 
knowledge of future jobs. An online algorithm is c-competitive if its competitive 
ratio is bounded by c. 

Previous Work. The problem of the present paper has been studied by many 
researchers [8,3,4]. Goldwasser [4] showed that the optimal competitiveness is 
2 + 1, where the slack of each job is at least k times the processing time. Very 
recently, Lee [6] presented an 0(Zog|:)-competitive randomized algorithm for a 
single machine and for multiple machines. There are also researches on several 
variants of this problem such as preemptive scheduling with different objectives 
[1,5] and scheduling with abort [12,7]. 

Team Adversary. Many researches have analyzed online algorithms using the 
competitive analysis [10]. It is assumed that the input of online problem is gener- 
ated by the malicious adversary having unlimited computing power. Sometimes, 
however, the worst-case input request is too severe to the online algorithm and 
rare in practice. Thus, many researches studied various relaxed settings of on- 
line problems by incorporating lookahead, access graphs, stochastic adversary, 
or diffuse adversary (for details, see [2]). 

In this paper we first introduce a quite natural parameterization of the adver- 
sary, called team adversary , which appears to be useful for analyzing the online 
scheduling problems with deadlines. The team adversary consists of m (e Z + ) 
weak adversaries and each weak adversary can generate serial jobs whose avail- 
able intervals between the release time and the deadline are disjoint. Notice that 
the traditional adversary corresponds to the case of m = oo. 

From the practical viewpoint, we can view that the input generated by the m- 
team adversary comes from m different sources, each of which is not overloaded. 

In addition to the analysis of the optimal competitiveness, there is another 
interesting question related with the m - team adversary: what is the smallest 
number m such that the m - team adversary can generate the worst-case input of 
the oo-team adversary (i.e., the traditional adversary)? We give a tight answer 
for this question. 

Restart. We consider the online algorithm with an additional capability. Our 
motivation is that the optimal competitiveness of 2+ 1 /k given in [4] still goes to 
infinity as k goes to 0. We assume that online algorithms can abort the currently- 
running job in favor of better jobs and restart it from scratch later if its deadline 
can be met. Notice that this scheduler is different from the preemptive scheduler 
that can suspend the running job and restart it from the point of suspension. 
Moreover, the optimal algorithm is the same as the optimal algorithm without 
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abort/restart, because abort and restart would not increase the gain of any offline 
algorithm 1 . This model was mentioned in [9] and studied in different settings [7, 
11]. Our framework is a generalization of Woeginger’s [12], in which every job 
has no slack time and thus restart is not considered. In our framework, each job 
has non-zero slack time and can restart. 

Our Results. We show that the optimal competitiveness against the team ad- 
versary of size m is 1 + min[ "V 1 . ^±1], which is a generalization of the optimal 
competitiveness of 1 + for the traditional adversary [4]. Our analysis im- 
plies that the m- team adversary has the same power as the traditional adversary 
(with respect to the competitive analysis) if and only if m is at least k + 2. 

We next present a constant competitive strategy with abort/restart. The 
interesting characteristic of our algorithm is that it is parameterized by c, indi- 
cating reluctance to abort, and the best value of c depends on the value of k. If c 
is large enough, our algorithm never aborts. Consequently, the analysis includes 
the analysis of the algorithm without abort/restart as a special case. 

2 Optimal Competitiveness 

against the m-Team Adversary 

In this section, we describe the optimal competitiveness against the m- team 
adversary. 

Theorem 1 The optimal competitiveness against the m-team adversary is 1 + 

min[™=i,^±!]. 

In the rest of this section, we show Theorem 1. The proof for the second term 
1+ k ^ is exactly same as the proof in [4] but we include it here for completeness. 

Proof of the Lower Bound. We denote a job Jj by ( rj,pj , Sj), where Sj is assumed 
to be at least k ■ pj. Imagine we are the m- team adversary who would like to 
raise the competitive ratio of auy online algorithm. Initially, one unit job J\ with 
arbitrarily large slack is given. As this job may be the only one to arrive, any 
online algorithm must schedule it. Suppose it was scheduled at t by the online 
algorithm. There are two cases. 

m — 1 ^ k -\- 1 

k ^ k ' 

Assume that e (> 0) is arbitrarily small. The m- team adversary generates 
m — 1 copies of a job ( t + e, ^—^,1 — 2e). Since the expiration time of each 
job is earlier than t + 1, the online algorithm cannot schedule any of them. 
However the optimal algorithm can schedule all of them by scheduling J\ 

1 In the non-preemptive scheduling, it is said that once a job begins running, it is 
not interrupted. Interestingly, there are two ways to interpret and implement this 
definition. First, each job may be inherently non-preemptive. Second, only a non- 
preemptive execution of each job is regarded as a success. In the offline scheduling, 
there is no difference between the above two. In the online scheduling, however, there 
is a big difference, as shown in this paper. 
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elsewhere. The competitive ratio is at least 1 + — (m — 1)2 e/k. By taking 

e' = 2(m — l)e/fc, we have that the competitive ratio is at least 1 + — e', 

for any e' (> 0). 

m — 1 \ fc+1 

k — k 

The m - team adversary gives one job ( t + e, 1 — 2e) and m — 2 copies 

of job ( t + e, 1 — 2e). Since the expiration time of each job is t + 1 — e, 

the online algorithm cannot schedule any of them. On the other hand, all 
jobs can be scheduled by the optimal algorithm. Thus the competitiveness 
is at least 1 + fc + 1 — 2(1 + ^)e, for an arbitrarily small e (> 0). By taking 
e' = 2(1 + i)e, we have that the competitive ratio is at least 1 + — e' . 

Proof of the Upper Bound. We say that a job Jj is available at t € [rj, ef] if Jj is 
neither be completed nor be running at t. We consider the online algorithm EEF 
(Earliest Expiration-time First) that schedules the available job with the 
earliest expiration time. We show that the competitive ratio of EEF is no greater 
than both of 1 + m “ 1 and 1 + — . 

k k 

As in [3], we consider the schedule a produced by EEF, and call the periods 
during which the resource is continuously in use the busy periods of a. Label 
these periods as 7 Ti, 7 r 2 , • • • , 717 and let bi and e*, respectively, denote the times at 
which 7 r,; begins and ends. Partition the job sequence S into classes Si, S 2 , • • ■ , <5;, 
where Si consists of exactly those jobs that arrived during the period [bi, ef). In 
order to prove that EEF is c-competitive, it suffices to prove this result over 
subsequence Si. Without loss of generality, we can artificially push back the 
release times of all other jobs to be later than the latest deadline of a job in Si. 
This change will have no effect on the gain for EEF and the additional time can 
only help the optimal offline algorithm. Therefore, from the point on we assume 
that our instances are such that the EEF schedule results in a single busy period. 
Let [0 ,t) be this single busy period. 

Let Zj denote the set of jobs generated by the i-th adversary. Suppose that 
jobs in Ai (C I,) are scheduled by EEF and those in B t (C I,) are scheduled by 
the optimal algorithm OPT but not scheduled by EEF. Let ai and b t , respectively, 
denote the sum of the lengths of jobs in Ai and Bi. Then we have the following 
lemma. 

Lemma 2 (a) t bi < t{ 1 + £). (6) b t < t -^~. 

Proof: (a) We first note that OPT cannot start to schedule any job in Bi after 
t, because otherwise, EEF can also schedule it after t, which contradicts the 
definition of t. Next, it is easily seen that all jobs in Ai and B, arrive after 0 
because EEF schedules a job at 0 for the first time. Finally, we claim that every 
job in Bi is shorter than f. Indeed otherwise, the slack of such a job is at least 
t and thus it must be available after t, so EEF will schedule at least one job after 
t, which is a contradiction. Thus OPT starts to schedule jobs in Bi during [0, t) 
and the last scheduled job can be no larger than |. Therefore JT bi is at most 
t(l + jr). 

(b) We fix i. Let AiUBi = {J ix , , • • • ,Ji }, where < r* 2 < • • • < r* . Recall 

that Ai is the set of jobs scheduled by EEF and any job in Bi is not available 
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at the time that some job in A; is scheduled, because they are generated by the 
same (i-th) adversary. It is easily seen that > 0 and r* > Y^iZi Pi (1 + k). 
Moreover, the expiration time e, is at least r,; +pi -k. So the last job’s expiration 
time is e* > Ya=i Pi + ( a * + &») ' Assume that the last job belongs to Bi. 
Then l Pi — a i ■ ^ this lemma does not hold (i.e., bi > ^f-), 

n— 1 

e* > ^2 Pi + (a* + h) - k 

l=i 

> ^ + bi ■ k 

> CLi + t — CLi 

= t 

Since the expiration time of the last job in Bi is later than t, this job is available 
after t and thus the online algorithm EEF must schedule it after t (or other jobs 
in Bi instead), which contradicts the definition of t. 

Now assume that the last job belongs to A*. Then Ym=i Pi > bi. The exe- 
cution of this job cannot end before r'i + Pi . If this lemma does not hold (i.e., 
b > t ~ a ) 



n— 1 

n +Pi > ^2 Pi (l + &) + Pi 

i=i 

n — 1 

= ^ + bi + ^2 Pi ’ k 
i=i 

(i% T bi -\ b ; ■ k 

> CLi + bi + t — CLi 

> t 

We have that the last job, scheduled by EEF, cannot finish before t, which is a 
contradiction to the definition of t. Therefore, this lemma holds. EQ 

The gain of the optimal algorithm is at most JA a i + Yli bi and the gain of 
the online algorithm is t. The competitive ratio of EEF is ^ a b m Plugging 
the bound of Lemma 2a and noting t = a,, we have that the competitiveness 

is at most 1 + Afyb Plugging the bound of Lemma 2b, we have that 

a i + J2i bj < Yli a i + ~i~ 

t ~ t 

_ 

t 



completing the proof of Theorem 1. 




Online Deadline Scheduling: Team Adversary and Restart 211 



3 Online Algorithm with Abort and Restart 



In this section, we consider an online algorithm with an additional capability. 
That is, an online algorithm is allowed to abort the current running job in favor of 
better jobs and restart it from scratch later (if the deadline does not expire) . We 
emphasize that this scheduler is different from the preemptive scheduler that can 
suspend the running job and resume it from the point of suspension. Moreover, 
the optimal algorithm is same as in the previous section, because abort and 
restart would not increase the gain of any offline algorithm. 

We introduce a variant of EEF with the parameter c (> 1), called EEF-c. Let 
|| • ]| denote a length of a job and let A denote the set of available jobs that 
are neither scheduled nor expired so far. Whenever the resource is idle, EEF-c 
first chooses a job Jj by EEF. The only one difference from EEF is that during 
the execution of the current job J j, if there is an available job Ji such that 
1 1 Ji 1 1 > c • 1 1 Jj || and Ji will expire before the completion of the current execution 
of Jj. EEF-c aborts Jj and schedules Ji. The aborted job Jj is put into A if it 
does not expire. Observe that if c is sufficiently large EEF-c is same as EEF. 



Theorem 3 The competitive ratio of EEF-c, against the m-team adversary, is 
bounded by the followings, for any m £ Z + and c (> 1): 



<( 



max(l — ck, 0) 
c — 1 



777 — 1 

1)( — ^ — ) + 1 fork> 0 



(1) 



„ ,max(l — cfc,0) + 

<( 1 +1)(— jp) + l for k > 0 



< m °(l~ cM > +1 + C+1 for k > 0 

c — 1 



(2) 

( 3 ) 



Proof: Using the arguments in the proof of Theorem 1, we can assume that our 
instances are such that the EEF-c schedule results in a single busy period [0,i). 
That is, the resource is continuously in use during [0, f) by EEF-c and it is idle 
right after t. 

It is possible that one job is aborted several times and accepted later. Through- 
out this proof, a task refers to an (possibly partial) execution instance of a job. 
A job is accepted if one of its tasks is accepted. By the algorithm EEF-c, at most 
one task is accepted for each job. Thus the sum of the lengths of the accepted 
tasks is that of the accepted jobs. 

Since a scheduled task may be aborted, the gain of EEF-c (over [0, i)) may 
be smaller than t. Let us first bound the gain of EEF-c from below. Assume 
that J tl , Jj , 2 , • • •, Ji are accepted tasks and each of Ji (1 < k < l) completes 
its execution during [a,; , b, ) . Let j!f , Jj 2 , • • • , Jj' ( denote the tasks partially 
scheduled in this order during [b, _ 1 ,aj ) and aborted before completion. We 
claim that 



- h < 



max(l — ck, 0) 



(h - ai ) 



c — 1 



( 4 ) 
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To show this claim, we first fix our attention to two tasks j !^ ( and Ji . The 
former was aborted at at in order to schedule the latter. From the definition of 
EEF-c, we have \\J t \\> c- \\J* ,\\- Similarly, \\Jj +1 \\ > c- \\Jj\\ (1 < r < k'). 
Thus 

Ell^lK^flW II 

r = 1 

Again consider J* t and J % . Let ty denote the time at which J* t is partially 
scheduled. If Ji is available at ty, J^ t never runs (in other words, ty = ai ) 
and so it contributes nothing to the left part of (4) and can be ignored. Assume 
otherwise, that is, J t arrives after ty and J* t is executed for some time. Since 
J* , is eventually aborted, J t expires before Jj , completes its execution. Since 
the slack of J t is at least k ■ 1 1 Ji \ | > ck \ \ j!f , 1 1 , the length of the partial execution 
of jf , is at most \\Jj , \ \ — ck\\j!f , || = (1 — ck)\\Jj t \ |. Plugging this formula 
into (4), we have that 

at - hi < max(l - ck,0) ■ } \\jf < - J7i (5) 

z — / j c — 1 

r= 1 

Summing the equation (5) over all accepted jobs Ji 1 , J7, 2 , • • • , Ji , we have 

( 6 ) 



Clearly the gain of EEF-c is l II & II- 

How much is the gain of the optimal algorithm? The optimal algorithm Dpt, 
at best, can fill the busy period [0, t), start a new job right before t and schedule 
the jobs accepted by EEF-c outside [0, t). By applying the arguments in the proof 
of Theorem 1, we have the the following upper bound on the gain of Opt: 



, . , an — 1 k T 1 , v — > I, 

Gain(Dpt) < mm( — - — , — - — )t+} J|j7i || 

rv K 

r= 1 

. . .m— 1 k + 1 . max(l — ck, 0) 

< H— — K 



i) + i]En^ 



r . .to— 1 k + 1. ,max(l — ck,0) _ . . . 

= min( — - — , — — )( ^ + 1) + 1 • Gazn(EEF-c) 

k k c — 1 



completing the proof of (1) and (2). 

It remains to show (3). Recall that the optimal algorithm, at best, can fill 
the busy period [0, £), start a new job J right before t and schedule the jobs 
accepted by EEF-c outside [0 ,£). It is easily seen that ||Jj| < c- \\J t ||. Indeed 
otherwise, EEF-c either aborts Ji and schedules J instead (if J expires before 
t) or schedules J (possibly other jobs instead) after t (if J does not expire 
before t), which is a contradiction of the definition of Ji and [0, t), respectively. 
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Therefore we have the the following upper bound on the gain of Opt: 

i 

Gain( Opt) < t + c ■ || Jj || + £iw ii 

fc= 1 

,max(l — cfc,0) „ . 

< ( + 1 + c + 1 • Gatn(EEF-c) 

c — 1 

completing the proof of this theorem. 0 

How good is the bound in Theorem 3? First of all, the competitive ratio 
is constant for any value k (> 0). If we let c = 2, the bound in (3) becomes 
4+1 — ck (< 5). The best competitive ratio of EEF-c depends on the choice 
of c, which again depends on the value of k. Clearly the best competitive ratio 
of EEF-c is no greater than that of EEF. It is more difficult to prove the lower 
bound for the online algorithm with abort and restart, because the behavior of 
the online algorithm is more complex. However the bound in Theorem 3 is not 
far from the lower bound because the lower bound of 4 is known in [12] for the 
special case of this problem where all jobs have no slack time. 

References 

1. S. Baruah, J. Harita, and X. Sharma. Online scheduling to maximize task com- 
pletions. In Proc. of IEEE Real-Time Systems Symposium, pages 228-237, 1994. 

2. A. Borodin and R. El-Yaniv. Online computation and competitive analysis. Cam- 
bridge University Press, 1998. 

3. S. Goldman, J. Parwatikar, and S. Suri. On-line scheduling with hard deadlines. 
Journal of Algorith, 34(2):370-389, 2000. 

4. M. Goldwasser. Patience is a virtue: the effect of slack on competitiveness for 
admission control. In Proc. of 10th ACM-SIAM SODA, pages 396-405, 1999. 

5. B. Kalyanasundaram and K. Pruhs. Maximizing jon completion online. In Proc. 
of ESA, pages 235-246, 1998. 

6. Jae-Ha Lee. Online deadline scheduling: multiple machines and randomization. In 
Proc. of SPAA, pages 21-25, 2003. 

7. Jae-Ha Lee and Kyung-Yong Chwa. Online scheduling of parallel communications 
with individual deadlines. In Proc. of ISAAC, pages 383-392, 1999. 

8. R.J. Lipton and A. Tomkins. Online interval scheduling. In Proc. of 5-th ACM 
SODA, pages 302-305, 1994. 

9. J. Sgall. Online scheduling. In Online Algorithms: The State of the Art, eds. A. 
Fiat and G. J. Woeginger, LNCS 1442, Springer Verlag, pages 196-231, 1998. 

10. D. Sleator and R. Tarjan. Amortized efficiency of list update and paging rules. 
Communications of ACM, 2(28):202-208, 1985. 

11. M. van den Akker, H. Hoogeven, and N. Vakhania. Restarts can help in the on-line 
minimization of the maximum delivery time on a single machine. In Proc. of 8th 
ESA, pages 427-436, 2000. 

12. G.J. Woeginger. On-line scheduling of jobs with fixed start and end times. Theo- 
retical Computer Science, 130:5-16, 1994. 




Minimum Sum Multicoloring 
on the Edges of Trees* 
(Extended Abstract) 



Daniel Marx 

Department of Computer Science and Information Theory, 
Budapest University of Technology and Economics 
Budapest Pf. 91, H-1521, Hungary 
dmar x@c s . bme . hu 



Abstract. The edge multicoloring problem is that given a graph G and 
integer demands x(e ) for every edge e, assign a set of x(e) colors to 
vertex e, such that adjacent edges have disjoint sets of colors. In the 
minimum sum edge multicoloring problem the finish time of an edge is 
defined to be the highest color assigned to it. The goal is to minimize 
the sum of the finish times. The main result of the paper is a polynomial 
time approximation scheme for minimum sum multicoloring the edges of 
trees. 



1 Introduction 

In this paper we study an edge multicoloring problem that is motivated by 
applications in scheduling. We are given a graph with an integer demand x(e) for 
each edge e. A multicoloring is an assignment of a set of x(e) colors to each edge 
e such that the colors assigned to adjacent edges are disjoint. In multicoloring 
problems the usual aim is to minimize the total number of different colors used 
in the coloring. However, in this paper a different optimization goal is studied. 
Given a multicoloring, the finish time of an edge is defined to be the highest 
color assigned to it. In the minimum sum multicoloring problem the goal is to 
minimize the sum of finish times. 

An application of edge coloring is to model dedicated scheduling of bipro- 
cessor tasks. The vertices correspond to the processors and each edge e = uv 
corresponds to a job that requires x(e) time units of simultaneous work on the 
two preassigned processors u and v. The colors correspond to the available time 
slots: by assigning x(e) colors to edge e, we select the x(e) time units when the 
job corresponding to e is executed. A processor cannot work on two jobs at the 
same time, this corresponds to the requirement that a color can appear at most 
once on the edges incident to a vertex. The finish time of edge e corresponds to 
the time slot when job e is finished, therefore minimizing the sum of the finish 
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times is the same as minimizing the sum of completion times of the jobs. Using 
the terminology of scheduling theory, we minimize the mean flow time, which is 
a well-studied optimization goal in the scheduling literature. Such biprocessor 
tasks arise when we want to schedule file transfers between processors [2] or 
the mutual diagnostic testing of processors [6] . Note the we allow that a job is 
interrupted and continued later: the set of colors assigned to an edge does not 
have to be consecutive, hence our problem models preemptive scheduling. 

Of particular interest is the case where the graph G is bipartite. A possible 
application of the bipartite problem is the following. One bipartition class corre- 
sponds to a set of clients, the other class corresponds to a set of servers. An edge 
e between two vertices means that the given client has to access the given server 
for x(e) units of time. A client can access only one server at the same time, and 
a server cannot accept connections from more than one client simultaneously. 
Clearly, bipartite edge multicoloring models this situation. 

Minimum sum edge multicoloring is NP-lrard on bipartite graphs even if 
every edge has unit demand [3]. For general demands, [1] gives a 2-approximation 
algorithm. The problem can be solved in polynomial time if every edge has unit 
demand and the graph is a tree [3,7]. 

In this paper, we consider the minimum sum edge coloring problem restricted 
to trees. We show that, unlike in the unit demand case, minimum sum edge col- 
oring is NP-lrard on trees if the demands are allowed to be at most 2. The main 
contribution of the paper is a polynomial time approximation scheme (PTAS) 
for minimum sum edge multicoloring in trees. 

In [4, 5] PTAS is given for the vertex coloring version of the problem in the 
case when the graph is a tree, partial k- tree, or a planar graph. One of their main 
tools is the decomposition of colors into layers of geometrically increasing size. 
This method will be used in this paper as well. However, most of the other tools 
in [4,5] cannot be applied in our case, since those tools assume that the graph 
can be colored with a constant number of colors. If the maximum degree of the 
tree can be arbitrary, then the line graph of the tree can contain arbitrarily large 
cliques. On one hand, these large cliques make the tools developed for partial 
/e-trees impossible or very difficult to apply. On the other hand, a large clique 
helps us in finding an approximate solution: since the sum of a large clique has to 
be very large in every coloring (it grows quadratically in the size of the clique) , 
more errors can be tolerated in an approximate solution, and this gives us more 
elbow space in constructing a good approximation. 

The paper is organized as follows. In Section 2 we introduce some notations 
and give results on the complexity of the problem. Section 3 gives a PTAS for 
trees where the maximum degree of the tree is bounded by a constant, while 
Section 4 gives a linear time PTAS for general trees. 

2 Preliminaries 

The problem considered in this paper is the edge coloring version of minimum 
sum multicoloring, which can be stated formally as follows: 
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Minimum Sum Edge Multicoloring (seme) 

Input: A graph G(V, E) and a demand function x: E — > IN. 

Output: A multicoloring \P: E — > 2^ such that ^(e)! = x(e) for every 
edge e, and 'f'(ei) fi t^^) = 0 if ei and e 2 are adjacent in G. 

Goal: The finish time of edge e in coloring E is the highest color assigned 
to it, /^(e) = max{c € 'Z'(e)}. The goal is to minimize Y2 e eE /^( e )> the 
sum of the coloring \T. 

We extend the notion of finish time to a set E of edges by defining fe(E) = 
Y^eeE Given a graph G and a demand function x(e) on the edges of G, 

the minimum sum that can be obtained is denoted by OPT(G, x). 

Henceforth the graph G is a rooted tree with root r. The root is assumed 
to be a node of degree one, the root edge is the edge incident to r. Every edge 
has an upper node (closer to r) and a lower node (farther from r). Edge / is a 
child edge of edge e if the upper node of / is the same as the lower node of e. In 
this case, edge e is the parent edge of edge /. A node is a leaf node if it has no 
children, and an edge is a leaf edge if its lower node is a leaf node. The subtree 
T e consists of the edge e and the subtree rooted at the lower node of e. 

A bottom up traversal of the edges is an ordering of the edges in such a way 
that every edge appears before its parent edge. It is clear that such ordering 
exists and can be found in linear time. 

Since the tree is bipartite, every node has a parity, which is either 1 or 2, 
and neighboring nodes have different parity. Let the parity of an edge be the 
parity of its upper node. Thus if two edges have the same parity and they have 
a common node v, then v is the upper node of both edges. 

If the tree has maximum degree A, then the edges can be colored with A 
colors. This color will be called the type of the edge. In some of the algorithms, 
the leaf edges are special, they are handled differently, therefore we want to 
assign a type only to the non- leaf edges. Clearly, if every edge has at most D 
non-leaf child edges, then the non-leaf edges can be colored with D + 1 colors, 
and they can be given a type from 1, 2, . . . , D + 1 such that adjacent edges have 
different type. 

The following lemma bounds the number of colors required in a minimum 
sum multicoloring (proof is omitted) . In the following, the maximum demand in 
the instance is denoted by p. 

Lemma 1. If T is a tree with maximum degree A and maximum demand p, then 
every optimum coloring of the seme problem uses at most p(2A — 1) colors. 

If both the maximum degree of the tree and the maximum demand is bounded 
by a constant, then the problem can be solved in linear time. The idea is that 
there are only a constant number of possible color sets that can appear at each 
edge, hence using standard dynamic programming techniques, the optimum col- 
oring can be found during a bottom up traversal of the edges. We omit the 
details. 

Theorem 2. The seme problem for trees can be solved in ■ n time. 
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On the other hand, if only the demand is bounded, then the problem becomes 
NP-complete: 

Theorem 3. Minimum sum edge coloring is NP -complete in trees, even if every 
demand is 1 or 2. 

Using polyhedral techniques, we can show that trees have the following scal- 
ing property (proof is omitted): 

Theorem 4. For every tree T, demand function x and integer q, if x'(e ) = 
q ■ x(e), then OPT(T, x') = q • OPT(T, x) holds. 

In general, Theorem 4 does not hold for every graph, not even for every bipartite 
graph. We will use Theorem 4 to reduce the number of different demand sizes 
that appear in the graph, with only a small increase of the sum: 

Lemma 5. Let (T,x) be an instance of seme and letx'(e) be |_(l + e)*J for the 
smallest i such that [(1 + e)*J > x(e). Then OPT(T, x') < (1 + e) • OPT(T, x). 

As explained in the introduction, our goal is to minimize the sum of finish 
times, not to minimize the number of different colors used. Nevertheless, in 
Theorem 6 we show that the minimum number of colors required for the coloring 
the edges of a tree can be determined in polynomial time. The approximation 
algorithm presented in Section 3 uses this result to solve certain subproblems. 

Theorem 6. Let T be a tree and let C = max,, gym ^2 e3v x(e). Every coloring 
of T uses at least C colors, and one can find in linear time a multicoloring 
F using C colors where each F(e) consists of at most two intervals of colors. 
Moreover, if each x(e) is an integer multiple of some integer q, then we can find 
such a F where the intervals in each F{e) are of the form [qi\ + 1, qi<f\ for some 
integers i\ and ii- 

Proof. It is clear that at least C colors are required in every coloring: there is a 
vertex v such that the edges incident to v require C different colors. The coloring 
F can be constructed by a simple greedy algorithm. Details omitted. □ 

3 Bounded Degree 

If a tree T has maximum degree A, then the line graph of T is a partial ( A — 1)- 
tree. Halldorsson and Kortsarz [4] gave a PTAS with running time n°( k / e 1 for 
minimum sum multicoloring the vertices of partial fc-trees, therefore there is a 
PTAS for seme in bounded degree trees as well. However, the method can be 
made simpler and more efficient in line graphs of trees. In this section we present 
a linear time PTAS for seme in bounded degree trees, which makes use of the 
special structure of trees. Furthermore, our algorithm works even if the degree of 
the tree is not bounded, but we know that every edge has a bounded number of 
non-leaf child edges. Most of the ideas presented in this section are taken from 
[4], with appropriate modifications. In Section 4 a PTAS is given for general 
trees, which uses the result in this section as a subroutine. 
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Fig. 1. The decomposition of the colors into layers (£ = 3) 



3.1 Layers and Zones 

An important idea of the approximation schemes given in [4, 5] is to divide the 
color spectrum into geometrically increasing layers, and to solve the problem 
in these layers separately. We use a similar method for the seme problem in 
bounded degree trees (Theorem 9) and general trees (Theorem 10). 

For some e > 0 and integer £ > 0, the (e,£) -decomposition divides the set 
of colors into layers Lq , and zones Zq , Z i, . . . , Zg. The layers are of ge- 

ometrically increasing size: layer Lj contains the range of colors from q , to 
qi + 1 — 1, where qi = [(1 + e) 1 J (if qt = qi+\ 1 then layer Li is empty). Denote by 
Q\ = \Li\ = q i+ 1 — qi the size of the zth layer. Notice that qi + i < (1 + e)qi. The 
total size of layers Lq, L i, . . . , Li is qi + 1 — 1. 

Each layer is divided into a main block and an extra block. The extra block is 
further divided into extra segments (see Figure 1). Layer Li is divided into two 
parts: the first colors form the main block of layer Li and the remaining 

colors the extra block. The main block of layer Li is denoted by Li. The 
union of the main block of every layer Li is the main zone Zq. Divide the extra 
block of every layer L, into £ equal parts: these are the £ extra segments of Li. 
The union of the jth extra segment of every layer Li forms the jth extra zone 
Zj. Each extra zone contains yyjjQf colors from layer L. h . Rounding problems 
are not discussed in this extended abstract, but they can be handled rigorously. 
We will need the following properties of the defined zones: 

Lemma 7. For given e and £, the (e, ^-decomposition of the colors has the fol- 
lowing properties: 

(a) For every c > 1, there are at least c colors in Zq not greater than [(l + e^)cj. 

(b) For every c > 1 and 1 < j < £, there are at least c colors in Zj not greater 
than [(1 + e) (1 + e£) /e • c] . 

Given a multicoloring F, the operation (e, £) -augmentation creates a multi- 
coloring F in the following way. Consider the (e, ^-decomposition of the colors, 
and if F(e) contains color c, then let <P(e) contain instead the cth color from 
the main zone Zq. By Lemma 7a, f&(e) < [(1 + &Q/i?(e)J, thus this operation 
increases the sum by at most a factor of (1 + e£). After the augmentation, the 
colors of the extra zones are not used, only the colors of the main zone. 
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3.2 PTAS for Bounded Degree Trees 

The polynomial time algorithm of Theorem 2 was based on the observation that 
we have to consider only a constant number of different colorings at each edge if 
both the demand and the maximum degree is bounded. In general, however, the 
number of different color sets that can be assigned to an edge is exponentional 
in the demand. The following lemma shows that we can find a good approximate 
coloring by considering only a restricted type of color sets. 

Lemma 8. For e > 0, define b h j = qt + j\e 2 Q\/ 4] . If each vertex of the tree T 
has at most D non-leaf child edges, then it has an (l+e)(l+(D+l)e)~ approximate 
coloring F with the following properties: 

1. In the (e, D)- decomposition of the colors, if e is a non-leaf edge, then F(e) 
contains colors from the main zone only between ex(e)/4 and 2x(e)/e. 

2. If e is a non-leaf edge of type k, then F(e) contains the first t e colors from 
extra zone Z f (for some t e ), and it does not contain colors from the other 
extra zones. 

3. If e is a leaf edge, then F(e) contains colors only from the main zone. 

4- If e is a non-leaf edge, then F(e ) contains at most two continuous intervals 
of colors from the main block of each layer, and the intervals in layer Li are 
of the form [bi^, bij 2 — 1] for some j i and j 2 - 

Proof. Let F be an optimum solution, and let F be the result of an (e, D)- 
augmentation on F. By Lemma 7a, fa(e) < (1 + (D + l))/<s(e) for every e. 

If /^(e) > 2x(e) /e for a non-leaf edge e of type j, then modify F(e) to be the 
the first x(e) colors of zone Zj. By Lemma 7b, Z 3 contains at least x(e) colors 
not greater than |~(l + e)(l-|-(.D-|-l)e)/e-x(e)"| < 2(l + (_D + l)e)x(e)/e. Therefore 
the x(e) colors assigned to e are not greater than 2(1 + (D + l)e)x(e)/e, hence 
fy( e ) < (1 + (D + 1 ))/<g(e). 

If F(e ) contains colors in the main zone below ex(e)/4, then delete these 
colors and let F(e) contain instead the first ex(e)/4 colors from zone Zj. There 
are at least ex(e)/4 colors in Z t below [(1 + e)(l + (D + l)e)/e • ex(e)/4] < 
(1 + (D + l))x(e) < (1 + (£> + 1 ))/<p(e). Therefore fy(e) < (1 + (D + 1 ))f<t>(e) 
for each edge e, and F is an (1 + (D + l)e)-approximate solution satisfying the 
first three properties of the lemma. 

Finally, we make F satisfy the fourth requirement as well. For each non-leaf 
edge e, let £j(e) be | F(e) D Lj| rounded down to the next integer multiple of 
\e 2 Q\/d\. If we use x, as a demand function on the non-leaf edges of the tree, 
then there is multicoloring satisfying Xj that uses at most \Li\ colors: F(e) n Li 
is such a coloring. Therefore by Theorem 6, there is a multicoloring Fi that 
uses at most |Lj| colors, satisfies Xj, and where each F(e) consists of at most 
two intervals of the form [1 + ji |"e 2 Q,-/4] , j 2 |"e 2 Q,-/4]] for some j\, j 2 - Modify 
coloring F: let F t determine how the colors are assigned in the main zone of layer 
i. Now the third requirement is satisfied, but it is possible that F assigns less 
than x(e) colors to an edge. We can lose at most \e 2 Q\/A\ — 1 < e 2 Q\/A colors 
in layer i, hence we lose at most the e 2 /4 part of each layer. Since F(e) contains 
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colors only up to 2x{e)/e, thus we lose only at most e 2 /4 • 2x{e)/e = ex(e)/2 
colors. If non-leaf edge e is of type j, then we use extra zone Zj to replace 
the lost colors. So far, edge e uses at most ex(e)/4 colors from Zj (previous 
paragraph), hence there are still place for more than 3ex(e)/4 colors in Zj below 
(1 + (D + l)e)x(e) < (1 + ( D + l)e)/<p(e). 

The modification in the previous paragraph can change the finish times of 
the non-leaf edges, but the largest color of each edge remains in the same layer, 
hence the finish time can increase by at most a factor of 1 + e. Moreover, since 
we modified only the non-leaf edges, there can be conflicts between the non-leaf 
and the leaf edges. But that problem is easy to solve: since the number of colors 
used by the non-leaf edges at vertex v from layer i does not increase, there are 
enough colors in layer i for the leaf edges. The largest color of each leaf edge will 
be in the same layer, hence its finish time increases by at most a factor of 1 + e, 
and f&(e) < (1 + e)(l + (D + l)e)f<p(e) follows for every edge e. □ 

Call a coloring satisfying the requirements of Lemma 8 a standard coloring. 
Notice that on a non-leaf edge e only a constant number of different color sets 
can appear in standard colorings: the main zone is not empty only in a constant 
number of layers, and in each layer the (at most two) intervals can be placed in a 
constant number of different ways. More precisely, in a standard coloring edge e 
can use the main zone only from layer log 1+e ex(e)/4 to layer log 1+e 2x(e)/e, that 
is, only in log 1+£ ((2x(e)/e)/(ex(e)/4)) = log 1+e 8/e 2 = 0(l/e • log 1/e) layers. In 
each layer, the end points of the intervals can take only at most 4/e 2 different 
values, hence there are (4/e 2 ) 2 different possibilities for each of the two intervals. 
Therefore if we denote by C e the different color sets that can appear in a standard 
coloring on non-leaf edge e, then \C e \ = ((4/e 2 ) 4 )°^ 1 / c ^ los = 2 c ’^ 1 / e ^' log 

Theorem 9. If every edge ofT(V,E ) has at most D non-leaf child edges , then 
for every eo > 0, there is a 2°( D / e °' log (-%£>)) . n fj me algorithm that gives an 
(1 + eo) -approximate solution to the seme problem. 

Proof. Set e := e 0 /(2 D + 3). We use dynamic programming to find the best 
standard coloring: for every non-leaf edge e, and every set S € C ei we determine 
OPT(e, S), which is defined to be the sum of the best standard coloring of T e , 
with the additional requirement that edge e receives color set S. Clearly, if all 
the values {OPT(r, S) \ S G C r } are determined for the root edge r of T, then 
the minimum of these values is the sum of the best standard coloring, which is 
by Lemma 8 at most (l + e)(l + (H-|-l)e) < (1 + cq) times the minimum sum. 

The values OPT(e, S) are calculated in a bottom up traversal of the edges. 
Assume that e has k non-leaf child edges ei,e 2 ,...,efc and I leaf child edges 
e\, e' 2 , ...,e' t When OPT(e, S) is determined, the values OPT(ej, Sf) are already 
available for every 1 < i < k and S) G C e . In a standard coloring of T e every edge 
e, is assigned a color set from C e . We enumerate all the n»=i |C e | possibilities 
for these color sets. For each combination 5i G C ei , . . . , 5/ G C e , we check 
whether these sets are pairwise disjoint. If so, then we determine the minimum 
sum that a standard coloring can have with these assignments. The minimum 
sum of subtree T e with color set S) on e, is given by OPT(e,, S)). The finish 
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time of edge e can be calculated from S. Now only the leaf edges e[, . .., e' e 
remain to be colored. It is easy to see that the best thing to do is to order these 
leaf edges by increasing demand size, and color them one after the other, using 
the colors not already assigned to e, e\, . . . , e*,. Therefore we can calculate the 
minimum sum corresponding to a choice of color sets S± € C ei Sk € C e , 
and we set OPT(e, S) to the minimum over all the combinations. 

The algorithm solves at most Sees IG| = n ' 2°^ 1 / e )' log2 subproblems. 
To solve a subproblem, at most 2 0 ( D '( 1 / e )' log !/ e ) different combinations of the 
sets Si,. . . ,Sk have to be considered. Each color set can be described by 0(l/e- 
logl/e) intervals, and the time required to handle each combination is polyno- 
mial in D and the number of intervals. Therefore the total running time of the 
algorithm is 2°( i>1 A-i°g 2 (iA)) . n = 2°^ Ao-iog 2 (r>Ao)) . n . □ 



4 The General Case 

In this section, we prove that seme admits a PTAS for arbitrary trees: 

Theorem 10. For every eo > 0, there is a 2°( 1 / e o log • n time algorithm 

that gives an eo-approximate solution to the seme problem for every tree T and 
demand function x. 

Proof. Let e := eo/72. The algorithm consists of a series of phases. The last phase 
produces a proper coloring of (T, x), and has cost at most (1 + eo)OPT(T, x). In 
the following we describe these phases. 

Phase 1: Rounding the Demands. Using Lemma 5, we can assume that 
x(e) is qi for some i, modifying x(e ) this way increases the minimum sum by at 
most a factor of 1 + e. An edge e with demand qi will be called a class i edge (if 
x(e ) = qi for more than one i, then take the smallest i). 

Phase 2: Partitioning the Tree. The edges of the tree are partitioned into 
subtrees in such a way that in a subtree the number of non-leaf child edges of a 
node is bounded by a constant. Now Theorem 9 can be used to find an approxi- 
mate coloring for each subtree. These colorings can be merged into a coloring of 
the whole tree, but this coloring will not be necessarily a proper coloring, since 
there might be conflicts between edges that were in different subtrees. However, 
using a series of transformations, these conflicts will be resolved with only a 
small increase of the sum. 

To obtain this partition, the edges of the tree are divided into large edges and 
split edges. It will be done in such a way that every node has at most D := 4/e 5 
large child edges. If a node has less than D children, then its child edges are large 
edges. Let v be a node with at least D children, and denote by n{v, i) the number 
of class i child edges of v. Let N(v) be the largest i such that n(v, i) > 0 and set 
F := 4/e 3 . Let e be a class i child edge of v. If n(v, i) > F, then e is a split edge, 
and it will be called a frequent edge. If n(v,i) < F and i < N(v) — [_l/e 2 J, then 
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e is a split edge, and it will be a called a small edge. Otherwise, if n(v, i) < F 
and i > N(v) — \l/e 1 \, then e is a large edge. Clearly, v can have at most 
F ■ Ll/ e2 J = 4/e 3 • [1 / e 2 J < 4/e 5 = D large child edges: for each class N(v), 
N(v) — 1, . . . , N(v) — Ll/e 2 J +1) there are at most F such edges. 

The tree is split at the lower node of every split edge, the connected compo- 
nents of the resulting forest form the classes of the partition. Defined in another 
way: delete every split edge, make the connected components of the remaining 
graph the classes of the partition, and put every split edge into the class where 
its upper node belongs. Clearly, every split edge becomes a leaf edge in its sub- 
tree, thus if every node has at most D large child edges in the tree, then in every 
subtree every node has at most D non-leaf child edges. 

Now assume that each subtree is colored with the algorithm of Theorem 9, 
this step can be done in 2°( D A- lo s (Pfc)) . n = 2°( 16 / € °' 1 / e ' los ( 4 A 6 )) • n = 

2 0(l/€ 11 -log 2 (l/e) ) 

• n time. Each coloring is an (1 + e)-approximate coloring of 
the given subtree, thus merging these colorings yields a (not necessarily proper) 
coloring of T such that f\p 1 (T) < (1 + e)OPT(T,x). In the rest of the proof, 
we transform into a proper coloring in such a way that the sum of the coloring 
does not increase too much. 

Phase 3: Small Edges. Consider the (e, t)- augment at ion of the coloring F\ 
with l \= 6. This results in a coloring F 2 such that fa 2 (G) < l + e£) (see 

Section 3). First we modify F 2 in such a way that the small edges use only the 
extra zones Z\ and Z 2 . More precisely, if a small edge e has parity r € {1,2}, 
then e is recolored using the colors in Z r (recall that the parity of the edge is 
the parity of its upper node). Since the extra zones contain only a very small 
fraction of the color spectrum, the recoloring can significantly increase the finish 
time of the small edges, roughly by a factor of 1/e. However, we show that the 
total demand of the small edges are so small compared to the largest demand 
on the child edges of v, that their total finish time will be negligible, even after 
this large increase. By definition, the largest child edge of v has demand q^tv)- 
Consider the small edges whose upper node is v, a node with parity r. Color 
these edges one after the other, in the order of increasing demand size, using only 
the colors in Z r . Call the resulting coloring $3. Let R v be the set of small child 
edges of v. We claim that j\p 3 ( R v ) < eqN( v ) for every node v, thus transforming 
F 2 into Fi increases the total sum by at most )T/, gT /o> 3 (A,;) < e H2veT Qn(v) < 
efy 2 (T) and f& 3 (T) < (1 + e)/W 2 (T) follows. To give an upper bound on f# 3 (R v ), 
we assume the worst case, that is, n(v,i) = F for every i < N(v) — [_1 / e 2 J . 
Imagine first that the small edges are colored using the full color spectrum, not 
only with the colors of zone Z r . Assume that the small edges are colored in the 
order of increasing demand size, and consider a class k edge e. In the coloring, 
only edges of class not greater than k are colored before e. Hence the finish time 
of e is at most 

k k 

E i)q i <FJ2^ + tY < 4 (4 + e)/e 4 ■ (1 + e) k 

i = 0 2=0 

< 5/e 4 • [(1 + e) fc J = 5/e 4 • q k . 
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That is, the finish time of an edge is at most 5/e 4 times its demand. Therefore 
the total finish time of the small edges is at most 5/e 4 times the total demand, 
which is 



JV(„)-Ll/e 2 j iV(„)-Ll/ e 2 j 

-4 n{v,i)qi<^ ^2 (1 + e)® 

6 i=0 6 i=0 

< |^(1 + e) Jv ^ ) -L 1/e2 J < ^(l + e^HiA 2 

< p • 2 _1 / e • (1 + e) s (”) < j ■ ^(1 + e) N(v) 



(In the fourth inequality we use (l + e) 1 / 6 > 2, in the fifth inequality it is assumed 
that e is sufficiently small that 2 1 / e > 44/e 10 holds.) However, the small edges 
do not use the full color spectrum, only the colors in zone Z r . By Lemma 7b, 
zone Z r contains at least c colors up to [(1 + e£)(l + e)/e • c] < 2/e • c, thus 
every finish time in the calculation above should be multiplied by at most 2/e. 
Therefore the sum of the small edges is 

e 2 

/<? 2 (Rv) fs 2/e • — • qN(v) < £Qn(v), 



as claimed. 



Phase 4: Shifting the Frequent Edges. Now we have a coloring t/A, that 
is still not a proper coloring, but conflicts appear only between some frequent 
edges and their child edges. First we ensure that every frequent edge e uses only 
colors greater than 2x(e)/e. After that, the conflicts are resolved using a set of 
so far unused colors, the colors in extra zones Z$ and Zq. 

Let M v be the set of frequent child edges of v, and let A v = |J egM ^ h { v ) be 
the colors used by the frequent child edges of node v. We recolor the edges in M v 
using only the colors in A v . Let ei,e 2 , . . . ,e\M \ be an ordering of the edges in 
M v by increasing demand size. Recall that the algorithm in Theorem 9 assigned 
the colors to the split edges (leaf edges) in increasing order of demand size, thus 
it can be assumed that frequent edge e\ uses the first x{e{) colors in A v , edge 
e 2 uses the a;(e 2 ) colors after that, etc. Denote by t(c) = \{e £ M v \fo 3 (e) > c}\ 
the number of edges whose finish time is at least c, and denote by t(c, i) = \{e £ 
M v \ /j/ 3 (e) > c, x(e) = qi}\ the number of class i edges among them. Clearly, 
t( c ) = YliL 0 *) bolds. Moreover, it can be easily verified that the total finish 

time of the edges in M v can be expressed as fa 3 (M v ) = t{ c )- 

The first step is to produce a coloring iq where every frequent edge e has only 
x(e)/(l + e) colors, but these colors are all greater than 2x(e)/e. The demand 
function is split into two parts: x(e) = Xi(e) + 22 (e), where 21 (e) is 2(e)/(l + 
e) and 22 (e) is e 2 (e)/(l + e), rounding problems are ignored in this extended 
abstract. 
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This phase of the algorithm produces a coloring 1F4 of M v that assigns only 
X\{e) colors to every edge e £ M v , but satisfies the condition that it uses only 
the colors in A v , and every edge e receives only colors greater than 2x(e)/e. In 
the next phase we will extend this coloring using the colors in zones and Z/±. 
every edge e will receive an additional X2 (e) colors. 

Coloring 'P 4 is defined as follows. Consider the edges ei,. ..,ei M | in this 
order, and assign to ek the first x\ (ek) colors in A v greater than 2 x(ek)/e and 
not already assigned to an edge ej ( j < k). Notice the following property of 
<^4: if j < k, then every color in 1^4 (ej) is less than every color in ^(e*,). This 
follows from 2 x(ej)/e < 2 x(ek)/e: every color usable for ek is also usable for ej 
if j < k. Define t'(c) = \{e £ M v |/^ 4 (e) > c}| and t'(c,i) = \{e £ M„|/^ 4 (e) > 
c, x(e) = Qi } | as before, but now using the coloring $4. We claim that t'(c,i ) < 
(1 + e)t(c,i) holds for every c > l,i > 0. If this is true, then t'(c ) < (1 + e)t(c) 
and ,/V 4 (M„) < (1 + e)f# 3 (M v ) follow from fa 4 (M v ) = Summing this 

for every node v gives fip 4 (T) < (1 + e)fy 3 (T). 

First we show that t'(c , i) < t(c, i) + 3/e. Denote by A c = \A V D [1, c — 1] | the 
number of colors in A v available below c. If every class i edge has finish time at 
least c in <^3, then t(c,i) — n(c,i ) > t'{c,i) and we are ready. Therefore there is 
at least one class i edge that has finish time less than c. This implies that the 
frequent edges of class 0, 1, 1 use only colors less than c. Denote by X the 

total demand of these edges (in the demand function x(e)), the class i frequent 
edges use at most A c — X colors below c. 

Now recall the way #4 was defined, and consider the step when every edge 
with class less than i is already colored. At this point at most X colors of A c 
are used (possibly less, since $4 assigns only aq(e) colors to every edge e, and 
only colors above 2x(e)/e). Therefore at least A c — X colors are still unused in 
Ay below c. From these colors at least A c — X — \2q i /e'\ of them are above 2 ®/e. 
Thus Sq can color at least (A c — X — \2qi/e\)/qi > (A c — X)/qi — 3/e edges of 
class i using only colors below c. However, 1F3 uses A c — X colors below c for the 
class i edges, hence it can color at most (A c — X)/qi such edges below c, and 
t'(c, i ) < t(c, i) + 3/e follows. 

We consider two cases. If t(c,i ) > 3/e 2 , then t'(c,i ) < t(c,i) + 3/e < (1 + 
e)t(c,i), and we are ready. Let us assume therefore that t(c,i) < 3/e 2 , it will 
turn out that in this case = 0. There are n(v,i ) — t(c,i) > n(v,i ) — 3/e 2 

class i edges that has finish time less than c in 1F3. Therefore, as in the previous 
paragraph, before $4 starts coloring the class i edges, there are at least (n(v, i ) — 
3/e 2 ) • qi unused colors less than c in A v . The total demand of the class i edges 
in xi(e) is at most n(e,i)qi/( 1 + e). The following calculation shows that the 
unused colors below c in A v is sufficient to satisfy all these edges, thus tfq assigns 
to these edges only colors less than c. We have to skip the colors not greater 
than 2 qi/e, these colors cannot be assigned to the edges of class i, which means 
that the number of usable colors is at least 

(n{v, i) - 3/e 2 ) • - 2 qje > (n(v, i) - — *— • 4/e 2 ) ■ q t + 1 

1 + e 

> (1 - — —)n(v,i)qi + 1 > n(e,i)qi/( 1 + e), 
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since n(v,i ) > 4/e 3 by the definition of the frequent edges. Therefore *lq assigns 
to the class i edges only colors less than c, hence t(c,i) = 0, as claimed. 

Phase 5: Full Demand for the Frequent Edges. The next step is to modify 
$q such that every frequent edge receives x(e) colors, not only 2i(e). Coloring 

is obtained from <Zq by assigning to every frequent edge e an additional 22(e) 
colors from zone Z3 or Z4. More precisely, let v be a node with parity r, and 
let ei, . . . ,e\M \ be its frequent child edges, ordered in increasing demand size, 
as before. Assign to ei the first 22 (ei) colors from Z 2+r , to e2 the first 22(62) 
colors from Z 2+r not used by ei, etc. It is clear that no conflict arises with the 
assignment of these colors. 

We claim that these additional colors increase the total finish time of the 
frequent edges at v by at most a factor of (1 + (£ + l)e). Let 2* = Sj=i x ( e j) 
be the total demand of the first i edges. The finish time of e^ in iq is clearly 
at least 2*, since 'Jq colors every edge ej with j < i before e,;. On the other 
hand, by Lemma 7b, zone Z 2+r contains at least 3/^2* colors not greater than 
\(1 + e£)x*]. These colors are sufficient to satisfy the additional demand of the 
first i edges. 



Phase 6: Resolving the Conflicts. Now we have a coloring iq such that there 
are conflicts only between frequent edges and their child edges. Furthermore, if 
e is a frequent edge, then iq(e) contains only colors greater than 2x(e)/e from 
the main zone. It is clear from the construction of tZq that only the colors in the 
main zone can conflict. 

Let e be a frequent edge that conflicts with some of its children. Let the child 
edges of e have parity r. There are at most 2(e) colors that are used by both 
e and a child of e. We resolve this conflict by recoloring the child edges of e in 
such a way that they use the first at most 2(e) colors in zone -ZZ+r instead of 
the colors in 'iq(e). It is clear that if this operation is applied for every frequent 
edge e, then the resulting color is a proper coloring. 

Notice that if a child edge e! of e is recolored, then it has finish time at least 
[22(e) /e] , otherwise it does not conflict with e. On the other hand, by Lemma 7b, 
zone Z4 +r contains at least 2(e) colors up to [(l + e£)-(l + e)2(e)/e"| < [22(e)/e"|, 
thus the recoloring does not add colors above that. Therefore the finish time of 
e! is not increased, since fy 6 (e') > [22’(e)/e]. 



Analysis. If we follow how the sum of the coloring changes during the previous 
steps, it turns out that iF 6 is an eo-approximate solution to the instance (T,xq). 

The running time of the algorithm is dominated by the coloring of the 
low-degree components with the algorithm of Theorem 9. This phase requires 
20(i6/e “-l/c-iog ( 4 /e 6 )) . n ^ _ 2 °( 1 / e o lo s (i/eo)) . n tj me . The other parts of the 

algorithm can be done in time linear in the size of the input. Therefore the total 
running time is 2 c ^ 1 / e ° log ■ n, which completes the proof of Theorem 10. 

□ 
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Abstract. We consider the scheduling problem of minimizing the aver- 
age weighted completion time on identical parallel machines when jobs 
are arriving over time. For both the preemptive and the nonpreemptive 
setting, we show that straightforward extensions of Smith’s ratio rule 
yield smaller competitive ratios compared to the previously best-known 
deterministic on-line algorithms, which are (4 + ej-competitive in either 
case. Our preemptive algorithm is 2-competitive, which actually meets 
the competitive ratio of the currently best randomized on-line algorithm 
for this scenario. Our nonpreemptive algorithm has a competitive ratio 
of 3.28. Both results are characterized by a surprisingly simple analysis; 
moreover, the preemptive algorithm also works in the less clairvoyant 
environment in which only the ratio of weight to processing time of a 
job becomes known at its release date, but neither its actual weight nor 
its processing time. In the corresponding nonpreemptive situation, every 
on-line algorithm has an unbounded competitive ratio. 



1 Introduction 

Model. We consider the problem of scheduling jobs arriving over time on-line on 
identical parallel machines to minimize the sum of weighted completion times. 
Each of the in machines can process only one of the n jobs at a time. Each 
job j of a given instance has a positive processing time pj > 0 and a nonneg- 
ative weight Wj > 0. We learn about these job data only at the job’s release 
date Tj > 0, which is not known in advance, either. If C 3 denotes the comple- 
tion time of job j in a feasible schedule, the corresponding objective function 
value is y^”_ 1 WjCj. We consider both the preemptive and the nonpreemptive 
machine environments. In the preemptive setting, the processing of a job may 
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be suspended and resumed later on any machine at no extra cost. In contrast, 
once a job is started in the nonpreemptive mode, it must be processed on the 
same machine without any interruption until its completion. In scheduling nota- 
tion [6], the corresponding off-line problems are denoted by P | r.j . pmtn | WjCj 
and P | r : j | ^ WjCj, respectively. Already the analogous single-machine problems 
are NP-hard [9, 10]. 

However, instances of 1 1 | ^ WjCj are optimally solved by Smith’s Weighted 
Shortest Processing Time (WSPT) rule, which sequences jobs in nonincreasing 
order of their weight-to-processing-time ratios [17]. For convenience, we assume 
that the jobs are indexed in this order so that w\/p\ > W 2 /P 2 > • • • > w n /p n . 
Moreover, we say that a job with a smaller index has higher priority than one 
with a larger index. 

The quality of on-line algorithms is typically assessed by their worst-case 
performance, expressed as the competitive ratio [16]. A p-competitive algorithm 
provides for any instance a solution with an objective function value of at most 
p times the value of an optimal off-line solution. 



Main Results. We show in Section 2 that a natural extension of the WSPT rule 
to preemptive scheduling on identical parallel machines with release dates is 2- 
competitive, and this bound is tight. The idea is to interrupt currently active 
jobs of lower priority whenever new high-priority jobs arrive and not enough 
machines are available to accommodate the arrivals. 

When preemption is not allowed, a straightforward extension of this scheme 
is to start the currently available job of highest priority whenever a machine 
becomes idle. However, this rule does not directly lead to a bounded competitive 
ratio. In fact, consider a single-machine instance in which a job of high priority 
is released right after the start of a long lower-priority job. Therefore, we first 
modify the release date of each job such that it is equal to a certain fraction of 
its processing time, if necessary. If we now start a long job j and a high-priority 
job becomes available shortly thereafter, the ill-timed choice of starting j can 
be accounted for by the fact that the high-priority job has a release date or 
processing time at least as large as a fraction of pj . Therefore, its delay is bounded 
by its own contribution to the objective function in any feasible schedule. We 
consider a family of alike algorithms in Section 3 and show that the best one is 
3.28-competitive. In this case, we cannot show that our analysis is tight, but the 
remaining gap is at most 0.5. 

Related Work. Lu, Sitters and Stougie [11] introduced a related class of 2- 
competitive algorithms, which use similar waiting strategies for the on-line vari- 
ant of the single- machine problem 1 1 r 7 - 1 ^ Cj . In fact, the idea of boosting 
release dates was used before by Hoogeveen and Vestjens [8] and Stougie (cited 
in [18]), who delayed the release date of each job j until time max{rj,Pj} and 
Tj + pj, respectively. Anderson and Potts [2] extended both, Hoogeveen and 
Vestjen’s algorithm and its competitive ratio of 2, to the setting of arbitrary 
nonnegative job weights. These results are best possible since Hoogeveen and 
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Vestjens also proved that no nonpreemptive deterministic on-line algorithm can 
achieve a competitive ratio better than 2. 

Phillips, Stein and Wein [12] presented another on-line algorithm for 
1 1 r j | 22 Cj , which converts a preemptive schedule into a nonpreemptive one 
of objective function value at most twice that of the preemptive schedule. Since 
Sclrrage’s Shortest Remaining Processing Time (SRPT) rule [13] works on-line 
and produces an optimal preemptive schedule for the single-machine problem, 
it follows that Phillips, Stein and Wein’s algorithm has competitive ratio 2 as 
well. The conversion factor is 3 — 1/m if applied to identical parallel machines, 
but the corresponding preemptive problem is NP-lrard in this case. However, 
Clrekuri, Motwani, Natarajan and Stein [3] noted that by sequencing jobs non- 
preemptively in the order of their completion times in the optimal preemptive 
schedule on a single machine of speed m times as fast as that of any one of the 
m parallel machines, one obtains a (3 — 1 /m)-competitive algorithm for the on- 
line version of P | rj \ 22 Cj. For the same problem, Lu, Sitters and Stougie [11] 
gave a 2a-competitive algorithm, where a is the competitive ratio of the direct 
extension of the SRPT rule to identical parallel machines. Phillips, Stein and 
Wein [12] showed that this rule is 2-competitive, but a smaller value of a has 
not been ruled out. 

In any case, the hitherto best known deterministic on-line result for the corre- 
sponding scheduling problems with arbitrary job weights, P | r ? , pmtn | 22 w jCj 
and P | r j | 22 w jCj was a (4 + e)-competitive algorithm by Hall, Schulz, Shmoys 
and Wein [7], which was given as part of a more general on-line framework. 
For 1 1 rj, pmtn | 22 lUj Cj , Goemans, Wein and Williamson (cited as personal 
communication in [14]) noted that the preemptive version of the WSPT rule is 
2-competitive; it schedules at any point in time the lriglrest-priority job, possi- 
bly preempting jobs of lower priority. (A proof of this result is given in [14].) 
Our preemptive parallel-machine algorithm is the direct extension of this varia- 
tion of Smith’s rule. Schulz and Skutella [14] and Goemans, Queyranne, Schulz, 
Skutella and Wang [5] give comprehensive reviews of the development of on- 
line algorithms for the preemptive and nonpreemptive single-machine problems, 
respectively; Hall, Schulz, Shmoys and Wein [7] do the same for the parallel 
machine counterparts. 

On the side of negative results, Vestjens [18] proved a universal lower bound 
of 1.309 for the competitive ratio of any deterministic on-line algorithm for 
P | r j | 22 Cj- In the preemptive case, the currently known lower bound is just 
22/21, also given by Vestjens. 

Let us eventually mention that the currently best randomized on-line algo- 
rithms for the two problems considered here have (expected) competitive ratio 2; 
see [15]. Moreover, the off-line versions of these problems are well understood; 
in fact, both problems have a polynomial-time approximation scheme [1]. 

2 Preemptive Parallel Machine Scheduling 

We consider the following extension of the single-machine preemptive WSPT 
rule to identical parallel machines. 
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Algorithm P-WSPT 

At any point in time, schedule the m jobs with the highest priorities 
among the available, not yet completed jobs (or fewer if less than m 
incomplete jobs are available). Interrupt the processing of currently 
active jobs, if necessary. 

The algorithm works on-line since the decision about which job to run at any 
given point in time t is just based on the set of available jobs at time t. In fact, 
it only depends on the priorities of the available jobs. In particular, Algorithm 
P-WSPT also operates in an environment in which actual job weights and 
processing times may not become available before the completion of the jobs, as 
long as the jobs’ priorities are known at their respective release dates. 

Theorem 1. The Algorithm P-WSPT produces a solution of objective function 
value at most twice the optimal value for the off-line problem P\rj,pmtn\ WjCj. 

Proof. Consider the time interval ( rj,Cj ] of an arbitrary but fixed job j. We 
partition this interval into two disjunctive sets of subintervals: I(j) contains 
the subintervals in which job j is being processed, and I{j) denotes the set of 
remaining subintervals, in which other jobs than j are being processed. Note 
that no machine can be idle during the subintervals belonging to /(j). Since the 
algorithm processes job j after its release date rj whenever a machine is idle, we 
obtain 

Cj < rj + i/(j)i + \m , 

where | • | denotes the sum of the lengths of the subintervals in the corresponding 
set. 

The overall length of I (j) is clearly pj. Only jobs with a higher ratio of 
weight to processing time than j can be processed during the intervals of the 
set I(j), because the algorithm gives priority to j before scheduling jobs with 
lower ratio. In the worst case, that is when |/(j)| is maximal, all jobs with 
higher priority than j are being processed in the subintervals of this set. Then 
\I(j)\ = (J2k<jPk)/ m > and we can bound the value of the P-WSPT schedule 
as follows: 

Y w i C i ^ Y + p i) + Y w i Y ~ • 

j j j k<j 

Since the completion time Cj of a job j is always at least as large as its 
release date plus its processing time, JT Wj(rj +Pj) is obviously a lower bound 
on the value of an optimal schedule. Moreover, JT Wj Y^,k<j Pk/ m is the objective 
function value of an optimal solution to the corresponding instance of the relaxed 
problem 1 1 1 'f2 w jCj on a single machine with speed m times the speed of any 
of the identical parallel machines. As this problem is indeed a relaxation of 
the scheduling problem considered here, we can conclude that the P-WSPT 
algorithm is 2-competitive. □ 

A family of instances provided by Schulz and Skutella [14] shows that this 
result cannot be improved. In fact, for m = 1, P-WSPT coincides with the 
preemptive single-machine algorithm studied in their paper. Taking m copies of 
Schulz and Skutella’ s instance yields the following result. 
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Lemma 2. The competitive ratio of the Algorithm P-WSPT is not better than 2 
for the on-line problem P | rj , pmtn | w jCj > f or an y given number of machines. 

Proof. We include a proof for the sake of completeness. We consider an instance 
that is slightly different from the one given in [14]. It consists of m copies of 
n + 1 jobs with Wj = 1, pj = n — j/n and rj = jn — j(j + l)/(2 n) for all 
0 < j < n. Algorithm P-WSPT preempts any job when it has left just 1/n units 
of processing time and finishes it only after all jobs with a larger release date 
have been completed. The value of this schedule is m ■ (^j=o( r ™ + .Pn + j/n)). 

An optimal off-line algorithm does not preempt any job and yields a schedule 
of value m- (EJ=o( r ! + Pl +i/ n ))- A simple calculation shows that the ratio of 
the values of the P-WSPT schedule and the optimal schedule goes to 2 when n 
goes to infinity. □ 

Of course, Theorem 1 subsumes the scheduling problem P | r ? , pmtn | Cj as 
a special case. Thus, this extension of the 2-competitive single-machine Shortest 
Processing Time (SPT) rule to the parallel machine setting has the same com- 
petitive ratio as the analogous extension of Sclrrage’s optimal single-machine 
SRPT rule [12], 



3 Nonpreemptive Parallel Machine Scheduling 



Every reasonable on-line algorithm for nonpreemptive scheduling has to make use 
of some kind of waiting strategy. We refer the reader to [18, Chapter 2] and [11] 
for comprehensive discussions of related techniques for the single machine. Here, 
we extend the idea of delaying release dates to the parallel machine problem. 

Algorithm Shifted WSPT 

Modify the release date of every job j to r' , where r' is some value 
between ma x{rj,a Pj} and rj + a Pj, for some a £ (0, 1]. Whenever 
a machine becomes idle, choose among the available jobs a job j with 
highest priority and schedule it on the idle machine. 



Note that this is indeed an on-line algorithm; we will later choose a so that we 
minimize the corresponding competitive ratio. Moreover, for m = 1 and a = 1, 
Algorithm Shifted WSPT is identical to the algorithm proposed in [11] for 

1 \ r j I E Cl- 
in the analysis of the Shifted WSPT algorithm, we make use of the follow- 
ing lower bound on the optimal value of the relaxed problem with trivial release 
dates, which is due to Eastman, Even and Isaacs [4]. 

Lemma 3. The value of an optimal schedule for an instance of the scheduling 
problem P\ \ 'Yh w jCj is bounded from below by 



71 71 

E \ ' Pk m — 1 \ - 

1=1 k<j 1=1 



2 m 
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Let us now analyze the performance of the Shifted WSPT algorithm. 

Theorem 4. The Algorithm Shifted WSPT has a competitive ratio of less 
than 2 + max{l/a, a + TffY } for the on-line problem P | rj | Y w jCj ■ 

Proof. The algorithm schedules a job j at time r' if a machine is idle and j is the 
job with the highest ratio of weight to processing time among all available jobs; 
otherwise, j has to wait for some time. The waiting time for j after r' is caused 
by two types of jobs: jobs with lower priority but which started before time r' , 
and jobs with higher priority. Let L{j) denote the set of lower-priority jobs £ > j 
with r' t < r'j and Cf > r', and let H(j) denote the set of higher-priority jobs 
h < j that postpone the start of j. Then, 

Cj < r'j + W(L(j )) + W(H(j)) + Pj , ( 1 ) 

where W(L(j)) and W{H{j)) denote the waiting time for j caused by jobs in 
the sets H(j) and L(j), respectively. Note that the algorithm does not insert idle 
time on any machine in the time interval between r' and the start of job j. Let 
us analyze the maximal waiting time that can be caused by jobs in the set L(j) 
and in the set H(j), separately. 



1. Jobs in L(j). Each machine has at most one job £ £ L(j). By definition, 
every one of these jobs satisfies a.pi<r\< r’j. Thus, pe < r'j /a. 

2. Jobs in H(j). The maximal waiting time for job j arises if all jobs h < j are 
processed after the completion of jobs in the set L(j) and before the start of 
job j. If all jobs in H(j) are simultaneously processed on m machines, then 
at least one machine finishes processing jobs in H(J) after a time period of 
length at most YheH(j)Ph/ m - Hence, the maximal waiting time for job j 
caused by jobs in H(j) is Yh<j Vh/m ■ 

Inequality (1) and the observations on the sets causing waiting time imply: 



Ci 



<a + -K- + E 



h<j 



Ph 

m 



Pj 



+'-)[<-, +^,)+'Yfv, 



EIl 

' m 

h<j 



m — 1 
2m 

Ph 



~Pj 



, r,- . m — 1 . . . , Ph to — 1 

= 1 a + <“ + 1ST' » > + {T > + P ’ ] + £ rn + TST P -’' 

h<j 



Thus, Algorithm Shifted WSPT generates a schedule of value 
E w jCj < (1 + max{ A , a + ^}) Wj(rj + Pj ) + Y w j( E jjt + Pj)- 

j j j h<j 



The proof is completed by applying two lower bounds on the optimal value: first, 
the trivial lower bound Yj w j( r j + Pj), and second, the lower bound presented 
in Lemma 3. □ 
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A simple calculation shows that the minimum of max{^, a+ is attained 
at a = (1 — m + yjl 6m 2 + (to — 1) 2 )/(4to) =: a m . In particular, a\ = 1. 

Corollary 5. The Algorithm Shifted WSPT with a = a m is (2 + l/a m )- 
competitive. The value of this increasing function ofm is 3 for the single-machine 
case and has its limit at (9 + VT7)/4 « 3.28 for m — I oo. 

Lemma 6. Algorithm Shifted WSPT cannot achieve a better competitive ra- 
tio than max{2+a, 1+ > 2+ '^ j ~ 1 for a £ (0, 1] , on any number of machines. 

Proof. We give two instances from which the lower bound follows. Consider 2m 
jobs released at time 0; half of the jobs are of type I and have unit processing 
times and weights e, whereas the other half of the jobs, type II, have processing 
time l+£ and unit weight. The algorithm modifies the release dates and schedules 
jobs of type I at time t = a first, one on each machine, followed by jobs of type II. 
The value of the schedule is m(a + l)s + m(a + 2 + e). In the optimal schedule, 
jobs of type II start processing first, at time t = 0, such that the value of the 
schedule is m(l + e) + m( 2 + e)s. For e — > 0, the ratio between the value of the 
Shifted WSPT schedule and the optimal schedule goes to 2 + a. 

The second instance consists again of 2 to jobs: half of the jobs are of type I 
and have release dates 1 + s, processing times £ and weights 1 /m, whereas the 
other half of the jobs, type II, are released at time 0 and have processing time 1/a 
and zero weight. Shifted WSPT starts scheduling jobs at time 1 and obtains 
a solution with a value of at least 1 + 1/a + e. The value of the optimal schedule 
is 1 + 2s. □ 

For the special choice a = a m , the first lower bound is tighter and it follows 
more concretely: 

Corollary 7. The Algorithm Shifted WSPT with a — a m is not better than 
(l + 7m+ yjl 6m 2 + (to — l) 2 )/(4?n) for instances with m machines. This means 
that our performance analysis has a gap of at most (to — l)/2 to < 0.5, and it is 
tight for the single-machine problem. 
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Abstract. In the extensible bin packing problem we are asked to pack 
a set of items into a given number of bins, each with an original size. 
However, the original bin sizes can be extended if necessary. The goal 
is to minimize the total size of the bins. We consider the problem with 
unequal (original) bin sizes and present the tight bound of a list schedul- 
ing algorithm for each collection of original bin sizes and each number of 
bins. We further give better on-line algorithms for the two-bin case and 
the three-bin case. Interestingly, it is shown that the on-line algorithms 
have better competitive ratios for unequal bins than for equal bins. Some 
variants of the problem are also discussed. 



1 Introduction 

We consider the following on-line extensible bin packing problem: there are m 
bins B i, i?2, . . . , B m with original bin sizes b\, 62, . . . , b m . The original bin sizes 
can be extended if needed. The (final) size of a bin is defined to be the larger 
one between the original bin size and the total size of the items assigned to it. 
Items {ai, a2, . . • , a n } arrive over list, which are not known in advance. When 
an item arrives it must immediately and irrevocably be assigned to one of 
the bins and the next item a^+i becomes known only after ai has been assigned. 
The size of item is pi- The goal is to assign all items to the bins such that the 
total size of the bins is minimized. 

This problem arises in a wide variety of contexts. It has many applications in 
bin packing, storage allocation and scheduling problems. Consider, for instance, 
a set of workers is given with regular working times. In case of needed, the worker 
can do some overtime works. The problem is to assign duties to the workers in 
such a way that the total work (regular working times plus overtime works) is 
minimized. 
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Competitive Ratios. An instance of the problem consists of a list of items 
and m bins with original sizes b\, 62 , . . . , b m . Let B be the collection of bin sizes, 
i.e. , B = {61, i>2, • • • , b m }. For any instance L, let A^m, B) and OPTi,(m, B) be 
the total size of the bins used by an on-line algorithm A and the total size of the 
bins used by an optimal off-line algorithm, respectively. Then the competitive 
ratio of algorithm A is 

, B) = sup Ai,(rn, B) / OPTi,(m , B). 

L 

If the bin sizes are identical, i.e., b\ = 62 = • • • = b m , the competitive ratio is 
denoted as Ra{iti). Further we define the overall competitive ratios as follows. 

Ra{iti,-) = sup Ra(iti, B), Ra = supii/^m, B). 

B m,B 

Known Results. The extensible bin packing problem with unequal bin sizes 
was first studied by Dell’01mo and Speranza [5]. They considered both the off- 
line case and the on-line case under the assumption that the maximum item 
size is not larger than the minimum bin size. They showed an upper bound of 
4 — 2\/2 for an LPT algorithm in the off-line case and an upper bound of 5/4 
for a list scheduling algorithm LS in the on-line case. The bound 5/4 is tight in 
terms of the overall competitive ratio, i.e., Rls = 5/4. 

The special case that all bin sizes are equal to one has been extensively stud- 
ied. It was proved that the off-line version of the problem is NP-hard in the 
strong sense [4]. Actually, it can be easily reduced from the 3-partition problem 
[7]. DeU’Olmo et al. [4] proved that the worst-case ratio of the LPT algorithm is 
13/12. If the number of the bins is fixed, it follows from the results of [11] that 
there exists a fully polynomial time approximation scheme. If m is not fixed, 
a polynomial time approximation scheme was provided in [1], Recently, Coff- 
man and Lueker [3] presented a fully polynomial time asymptotic approximation 
scheme. They also proved that the problem does not admit a fully polynomial 
time approximation scheme unless P = NP, when m is not fixed. The on-line 
version was studied by Speranza and Tuza [10]. They showed that the overall 
competitive ratio of a list scheduling heuristic is 5/4. Then a class of heuristics 
H x were presented which depend on a parameter x, 0 < x < 1, and tend to 
load partially loaded bins until a limit total load of 1 + x is reached on the bin. 
For any number of bins, the algorithm has an overall competitive ratio bounded 
above by 1.228. For the lower bound of the on-line problem, they showed that no 
heuristic can have a competitive ratio smaller than 7 /6 by presenting an instance 
for the case that m = 2. Ye and Zhang [12] considered on-line scheduling on a 
small number m of bins. They proved lower bounds for m = 3,4 and gave the 
competitive ratios Rh ( m ) for m = 2,3 and 4. For in = 2 the algorithm is best 
possible. An improved algorithm for m = 3 was also presented. 

A similar problem, called on-line bin stretching, was introduced by Azar and 
Regev [2]. A set of bins of fixed sizes is given. Items arrive on-line while it is 
known that there exists a valid packing of all items into the given bins. One is 
asked to pack all items into the bins, which can be overloaded. The goal function 
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is the stretch factor, which is the maximum ratio for any bin between the size to 
which any bin was stretched (the sum of all items assigned to it) and its original 
size. Azar and Regev [2] studied the problem of identical bins, i.e. all original 
bins are of size 1. Epstein [6] considered the two-bin case with unequal bin sizes. 

Our Results. In this paper, firstly we assume, as in [5], that the largest item 
size is at most the smallest bin size, i.e., 



maxp,; < min&j. (1) 

*= i 3 — 1 

We analyze the LS algorithm more carefully and prove the competitive ratios for 
each to and each collection B of bin sizes. The ratios differ for an even number 
of bins and an odd number of bins. It also shows that the competitive ratio of 
LS for the equal bins is exactly 5/4 when m is even and 5/4 — 1/(4 to 2 ) when m 
is odd. We then present an improved on-line algorithm for m = 2 and in = 3. 
Finally we discuss the problem without the assumption (1). A lower bound 6/5 
for the overall competitive ratio is presented. We prove the competitive ratio of 
LS for the two-machine case and present an improved on-line algorithm. 

Notations and Organization of the Paper. Consider any algorithm A. 
During the execution of algorithm A , if a bin Bj has a load (the total size of 
items assigned to it) is over its original size bj, Bj is called heavy; otherwise, it 
is called light. The difference between the load and the size bj is idle space if Bj 
is light or excess if Bj is heavy. 

Consider any instance L where < Sjli ty- In an optimal pack- 

ing there must be some bins with idle space. Construct a new instance L' 
by adding new items such that no bins have idle space in the optimal pack- 
ing without changing the optimal value. Note that A^(m.,B ) > Al(i71,B) and 
OPT L f(m.,B) = OPT L {m,B). Therefore, 

A l (to, B) /OPT l (to, B) < A L i (to, B)/OPT l > (m,B). 

In instance L' , the total size of items is at least the total original size of bins. It 
implies that we only need to consider the instances in which the total size of items 
is at least the total original size of bins to prove an upper bound. Throughout 
the paper in proving the competitive ratios we always assume 



^Pi>^bj. ( 2 ) 

*=1 3 = 1 

The remainder of the paper is organized as follows. Section 2 gives the tight 
bound for a list scheduling algorithm. In Section 3, we present an on-line al- 
gorithm for m = 2 and to = 3. We discuss the problem without the item size 
assumption (1) in Section 4. 
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2 Tight Bound for a List Scheduling Algorithm 



List Scheduling was introduced by Graham [8] for parallel machine scheduling. 
The algorithm always schedules the current job to the machine with minimum 
load at this time. Applying list scheduling to extensible bin packing with unequal 
bin sizes may result in different versions. We first consider a list scheduling 
algorithm, denoted by LS , which always assigns the incoming item to the bin of 
largest idle space. Dell’Olmo and Speranza [5] proved that Rls = 5/4. In this 
section we figure out -Rls(to, B). Then we will show that a different version of 
list scheduling, which always assigns the incoming item to the bin with the least 
load (the total size of items in a bin), is not as good as LS in terms of the overall 
competitive ratio. 

Let b m i n be the smallest bin sizes in the collection B and let p max be the 
largest item size. Under the assumption (1), we have p max < b m i n . Consider the 
packing given by algorithm LS. For each bin Bj, let lj be the total size of the 
items it accommodates after the schedule is over. If Bj is light, let Sj be its idle 
space, i.e., s :j = bj — lj. If Bj is heavy, let r'j be its idle space immediately before 
it becomes heavy, and ej be its excess, i.e., Cj = lj — bj. We re-index the bins so 
that the first k bins are heavy. 

In proving an upper bound for algorithm LS we can assume, without loss of 
generality, that a bin accepts no more items once it becomes heavy. Otherwise, 
all bins are heavy and the packing is optimal. 



Lemma 2.1. The competitive ratio of the list scheduling algorithm 



1 + 



mb 



R LS (m,B) > 



4 E =x b 



if m is even, 



1 + if m i s odd. 

4m 2_^ _ x b 



Proof. Consider the following instances. If m is even, a large number of small 
enough items are coming first. The total size of these items is Ej=i bj — mb m i n / 2 
such that after assigning these items by LS , each bin has idle space exactly 
bmin/ 2. Then m/2 items with size of b m i n are coming. Clearly, the optimal 
value is J^JLibj, while LS generates a total size Ejli bj + mb m j n / 4. Thus 
R L s(m,B)>l + ^— v . 

If m is odd, the instance is slightly different. A large number of small enough 
items are coming first. The total size of them is Efli bj — {jn — l)b m i n /2, so 
that after assigning these items each bin has idle space exactly b mln . Then 
(m — 1)/2 items with length 6 min are coming. Clearly, the optimal total bin size 
is Ejli bj , while LS generates a total bin size of J2jLi bj + (w 2 — l)&mm/(4rn). 

Thus Ri,s(jn,B) > 1 + - . The lemma is proved. □ 



Theorem 2.2. The competitive ratio of the list scheduling algorithm 



1 + 



mb 



if m is even, 



R LS (m,B) = \ (J/_i) 6 . f . , , 

1 + ^ r - ? !! m 1S 0( lcl. 

1 4m y j b 7 
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Proof. We only need to prove that 



1 + 



mb 



R L s{m.,B) < 



4 E =1 fc 



if to is even, 



1 + ) if m is odd. 

4m y _ _ i b 



Consider any instance L(m,B) satisfying (2). By the algorithm LS , we know 



that 



Then 



E k 
7 = 1 



i= i ' J 



> min r, > max s,- > 



E m 

j=k + 1 S 3 



i n maA. '-’7 _ 

j=fc+l,..-,m TO — k 



(to - fc) J2rj>k 



j — i i=fc+i 



From (2), we have 



E e ^ E ; 

i — 1 j=/c+l 



(3) 



(4) 



We add (to — fc) Ei=i e i t° both sides of the inequality (3) and from (4), we get 



<7=1 'O 

k m k 

(to. -k)Y ( r 3 + e j) > k E s i + ( m ~ k ) E e ! 

j=i i=fe+i i=i 

m m 

— k s 3 4" ( m — s j 

j—k -\- 1 j=fc-|-l 

m 

= TO E s i- 

j=fc+i 



It means that Ejlfc+i s j — Ej=i ( r j + e j)- Note that the competitive 

ratio 

n m n 

R L s(m,B) <(Y,Pi+ E S f)/E ft ' 

£=1 j=fc+l 2=1 



Then 



(m 

R-Lsiph B) < H 



fc ) EjLl (C + e j) ^ , ( m 

E n - ^ t 

i=iPi 



k ) E*=i + e i) 



Since + ej < p m ax < 6 m j n , we obtain 



R LS (m, B) < 1 + 



(to - k)kb min 

">E”,^ 
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If to is even, then to 2 — 4 mk + 4 k 2 = (m — 2k) 2 > 0 and (to — k)k/m 2 < 1/4. It 
follows 



R LS (m,B) < 1 + 



mb n 



4 £7=1 V 



If to is odd, then to 2 — 4 mk + 4 k 2 = ( m — 2k) 2 > 1 and (to — k)k < \{m 2 
We have 



R LS (m,B) < 1 + 



(?n 2 - 1 )bmin 

4mEr=i^' ' 



!)• 



□ 



Corollary 2.3. 

R LS (m,B) < R L s(m) 



if to. is even, 
f - 4 ^ 2 , if m is odd. 



Moreover, Rls = 5/4. 



□ 



Note that J?ls(to, B) < RLs{m) if to > 2 and 6 , ^ bj for some i,j. It means 
that the competitive ratio becomes smaller if the bin sizes are unequal. 

Before closing this section, we consider a different version of list scheduling: 
assign items to the bin of least load. Denote the algorithm as LS' . Consider 
the following simple instance L for two bins. Let bi = 2 b 2 and let £ > 0 be 
a sufficiently small number. The first two items have size of 62 — £, which are 
followed by an item with size of s/2. By algorithm LS' , after assigning the first 
two items, each bin has a load of 62 — £• If the 3rd item goes to the first bin B\, an 
item (the last one) with size of 62 comes, which is assigned to f? 2 - If the 3rd item 
goes to the second bin f? 2 , then the 4tlr item has a size of e and the 5tlr item (the 
last one) has size of 62 - The 4th is assigned to Bi while the 5tlr is assigned to B 2 . 
In both cases, LS' l { 2, B) = b\ + b 2 + b 2 — s. In an optimal packing, we can assign 
the last item to B 2 and the others to B\. Then OPTl( 2,B) = bi + b 2 , which 
shows that Rls>(2,B) — > 4/3 as £ tends to zero. It implies that Rls’ > 4/3. 



3 The Two- and Three-Bin Cases 

Assume that b\ > b 2 > ■ ■ ■ > b m > p m ax ■ In this section we consider the cases 
that to = 2 and to = 3. 

Algorithm A m (a ): Assign the incoming item a-, to the lowest indexed bin, if 
the total excess of the bins is not greater than a; otherwise assign a,; to the bin 
of largest idle space. 

In this algorithm a is a user-specified parameter. Choosing different param- 
eters results in different algorithms. 

Lemma 3.1. For any parameter a > 0, the competitive ratio of algorithm A 2 (a) 
is at least 1 + b 2 /(3(bi + b 2 )). 
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Proof. Let N be a sufficiently large integer. If a < 1)2/3, consider an instance 
L as follows. The first [_(6i — 2£>2 / 3 ) A^J items are small items, each with size 
1 /iV, which are followed by items 01,02,03 with size of 62/3, 2& 2 / 3 and 2 b 2 / 3 , 
respectively. Clearly, A 2 (a) assigns all the small items together with ai to bin 
B 1, and a 2 to bin B 2 . Thus no matter where 03 is assigned, A 2 (a)i( 2 , B) > 
61 + 62 + 62/3 — 1 /N. For an optimal solution, we can assign ai and 02 to B 2 , 
and the remaining items to B\. Thus OPTl( 2 ,B) = 61 + b 2 , which follows that 
A 2 (a)L( 2 ,B)/OPT L ( 2 1 B) 1 + 62/(3(61 + 6 2 )) as N goes to infinity. 

If a > 62/3, consider the following instance L: the first [(6i — 26 2 / 3 )iVJ 
items with size 1 /iV, are followed by item 01 with size b 2 . So A 2 (o.)l{ 2 ,B) > 
6i + 6 2 / 3 + 6 2 — 1 /N. In an optimal packing, we assign ai to B 2 , and the remaining 
items to B\. OPTl( 2 ,B) = b\ + b 2 , which follows A 2 (a) l( 2 , B) / OPTl( 2 , B) — > 
1 + 6 2 / ( 3 ( 6 i + 6 2 )) as N tends to infinity. □ 

By setting a = b 2 /3 , we prove that the competitive ratio of A 2 (a) is 1 + 
62/(3(61 + 62)). 

Theorem 3 . 2 . The competitive ratio of A 2 (a) is 1 + 6 2 /( 3 ( 6 i+ 6 2 )) if a = 6 2 / 3 . 
Moreover the overall competitive ratio is 7 / 6 . 

Proof. If both bins are light or both are heavy, the packing is optimal. We only 
need to consider the case that exactly one bin is heavy. From the assumption ( 2 ), 
we have Mi + M 2 > 61 + 6 2 , where M t is the load of bin B t , i = 1 , 2 . 

Let a = 6 2 / 3 . Let T be the excess of the heavy bin and let X be the idle 
space of the light bin. If T < a, then R A ^ a ^( 2 ,B) < (T + b± + b 2 )/(b\ + b 2 ) < 
1 + 62/(3(61 + 62)). Now assume that T > a. We want to prove X < a. 

Case 1 . B 2 is light. X = b 2 — M 2 . Let a n be the last item assigned to B 1, 
which makes the excess over a. B 2 ^ 0 and before a n is assigned B\ is light 
and has an idle space at least X. Otherwise, a n should have been assigned 
to B 2 . It implies that the size of the items in B 2 is larger than X + a. Thus 
X = 62 — M 2 < 62 — (X + a). It follows that X < a. 

Case 2 . B\ is light. X = 61 — M\. Before the last item is assigned to B 2 , 
B 2 is light and has an idle space at least X. The total size of items in B 2 
before the last item is assigned is at most b 2 — X. According to the algorithm, 
b 2 — X + Mi > 61 + a. It implies that X < a. 

In both cases we have R A 2 ^( 2 ,B) < (X + Mi + M 2 )/(Mi + M 2 ) < 1 + 
62/(3(61 + 62)). It is easy to verify that R^ 4 2 ( a )(2,-) = 7 / 6 . □ 

Since no on-line algorithm can have an overall competitive ratio less than 
7/6 for two bins [ 5 ], A 2 (a ) is an optimal on-line algorithm for m = 2 in terms 
of the overall competitive ratio, setting a = b 2 / 3 . 

We will show a lower bound for two bins under the assumption ( 1 ). Let 
61 = kb 2 /3 + x, where 0 < x < b 2 /3 and k > 3 (k is an integer). 

Lemma 3 . 3 . No on-line algorithm can achieve a competitive ratio smaller than 

f 1 + (62/3 - x)/{bi + 6 2 ), if 0 < x < 62/6; 

— \ 1 + x/(bi + 62), if 62/6 < x < 62/3. 
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Proof. Let A be any on-line algorithm. Consider the following instance L. The 
items with size of 62/3 come one by one until 71(62) = 2 or n(&i) = k — 1, where 
n(bi) is the number of items with size of 62/3 placed into bin Bi by algorithm A 
for i = 1,2. 

Case 1. 71(62) = 0 ,n( 6 i) = k — 1 . If 0 < 2; < 62/6, the next two items have size 
of 262/3. Thus Al( 2 ,B) > 61 + 62 + 62/3 — x. 

If 6 2 /6 < x < 62/3, the next item has size of x. If x goes to B 3 , two items 
with size of 262/3 come. It follows that Al( 2 ,B) > 61 + 62 + 62/3. If x goes to 
B 2l an item with size of 62 comes. Then Al( 2 , B) > 61 + 62 + x. 

Case 2. 77(62) = 1, 77(61 ) = k — 1. The next item has size of 62 and then 

Al { 2,yB) > 61 + 62 + 62/3. 

Case 3. 71(62) = 2, ?r(6i) = p, where 0 < p < k — 1. 

Subcase 3 a) lip = k— 1 , the next item has size of 262/3 + 2:, and thus Al( 2 , B) > 
61 + 62 + 62/3. 

Subcase 3 b) If p = k — 2 , we consider two subcases. If 0 < x < 6 2 /6, the next 
item has size of x. If it is assigned to B\, an item with size of 62 comes. Then 
Al( 2 ,B) > 61 + 6 2 + 62/3. If it is assigned to B 2 , an item with size of b 2 comes. 
So Al{ 2 ,B) > 61 + 62 + 62/3 — x. If 62/6 < x < 62/3, the next two items have 
size of 62/3 + x and 2 b 2 / 3 . So, Al( 2 ,B ) >61 + 62 + 2:. 

Subcase 3 c) If p = k — 3 , the next two items have size of 2 b 2 /3 + x/ 2 . Al( 2 ,B) > 
61+62 + 62/3. 

Subcase 3 d) lip < k — 4 , let s = \(k—p)/ 3 ]-l. The next s items have size of b 2 . If 
one of such items is packed into B 2 , then Al( 2 ,B) > b\ + 62 + 2 b 2 / 3 . Otherwise, 
all the s items go to B\. The idle space of B\ becomes (k — p)b 2 /3 + x — (|~(fc — 
p)/ 3 ] — 1)62, which is either 6 2 /3 + x or 2 b 2 /3 + x or 6 2 + x. Then it is reduced 
to the above three subcases 3 a) - 3 c). 

Note that in any of the above cases, OPT L ( 2 , B) = bi + b 2 . Thus, the lemma 
is proved. □ 

Lemma 3.3 shows that Ra( 2 , B) > 1 + 6+f++) ^ or an ^ on “li ne algorithm A. It 
also shows that algorithm A 2 (a) is optimal for the case that x = 0 (or 61 = kb - 2 /3 
for some integer k ), by setting a = b 2 / 3 . Now we consider the tlrree-bin case. 

Lemma 3.4. For any parameter a > 0, the competitive ratio of algorithm ^ 3 ( 0 ) 
is at least 1 + 63/(2(61 + b 2 + 63)). 

Proof. Let TV be a sufficiently large integer. If a > 63/2, consider the following 
instance L. The first [6iTV — lj items with size of 1 /iV are followed by an item of 
size 63/2 + l/TV. Clearly, A 3 (q:)l(3, B) > 61 + 62 + 363/2 — 1 /TV and OPTl{ 3 ,B) < 

61 + 62 + 63 + 1 /TV. It follows that A 3 (a) l( 3 , B) /OPT L ( 3 , B) -+ 1 + 63/(2(61 + 

62 + 63)) as TV tends to infinity. 

If a < 63/2, consider the following instance L. [6iTV — lj items with size of 
1 /TV are followed by item ai with size of a + l/TV. Then [(62 — 63/2)^ +1 items 
with size of 1 /TV are coming, which are followed by the last two items a 2 , a 3 with 
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size of 63/2, 63— a— 1 /N, respectively. Algorithm A 3 (a) assigns the first [biN— lj 
items together with a\ to B\, [(^2 — b 3 / 2 )N + lj items with size 1 /N to B2, and 
02 and 03 to B 3 . It shows A 3 (a)i,( 3 ,B) > 61 + 62 + 363/2 — 2 /N. In an optimal 
packing, we can assign a\ and a 3 to B 3 , the first [6iiV — lj items with size 1 /N to 
Bi and the remaining items to B 2 . It follows that OPTl{ 3 , B) < bi+b 2 +b 3 +l/N. 
Then A 3 (a)i( 3 , B)/OPTl( 3 , B) -a 1 + 63/(2(61 + 62 + 63)), as N tends to infinity. 

□ 

In the following, we prove that the competitive ratio of algorithm A3 (a) can 
reach the lower bound 1 + 63/(2(61 + 62 + 63)) by setting a = 63/2. 

Theorem 3 . 5 . The competitive ratio of the algorithm A3 (a) is 1 + 63/(2(61 + 
62 + 63)) ifa = 63/2. 

Proof. Let a = 6 3 / 2 . In the packing given by A 3 (a), let Mi be the total size of 
the items assigned to bin Bi. If all the bins are heavy or all the bins are light, 
the algorithm gives an optimal packing. Hence, we only need to consider the 
following cases. 

Case 1 . Only one bin is heavy. If the excess of the heavy bin is at most a, we 
can get the competitive ratio 1 + 63/(2(61 + 62 + 63)) immediately. Before the 
last item of the heavy bin is assigned, all the three bins are light and the heavy 
bin has the largest idle space at the moment. 

Subcase la) Bi is heavy. Then M\ > 61 + a. Note that M 2 + M3 >62 + 0. We 
have 

Ra 3 (c)( 3 ,B) < (Mi + 62 + 63)/ (Mi + M 2 + M 3 ) 

= 1 + (62 + 63 — M2 — M3) / (Mi + M2 + M3) 

< 1 + 63/ (2(6i + 62 + 63)) 

Subcase lb) B 2 is heavy. M 2 > 6 2 + 63/2 and Mi + M3 >61 + 63/2. 

Ra 3 (c)( 3 ,B) < (M 2 + 61 + 63 )/ (Mi + M 2 + M 3 ) 

= 1 + (61 + 63 — Mi — M3) / (Mi + M 2 + M3) 

< 1 + 63 / (2 (Mi + M 2 + M 3 )) 

< 1 + 63/ (2(6i + 62 + 63)) 

Subcase lc) B 3 is heavy. M 3 > 63 + 63/2. Let X be the total idle space of Mi 
and M 2 . Then X = 61 + b 2 — (Mi + M 2 ). Let Y 3 be the load in B 3 before the 
last item is assigned. Thus 



b 3 — Y 3 > 62 — M 2 , 63 — Y 3 > 61 — Mi . 

We have X = b\ + 62 — (Mi + M 2 ) < 2(63 — Y 3 ). On the other hand, 



Y 3 + Mi > 61 + 63/2, Y3 + M 2 > 62 + 63/2. 
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We have X = b 1 + b 2 -{M 1 + M 2 ) < 2 Y 3 -b 3 . So, X < min{2(6 3 - Y 3 ), 2Y 3 -6 3 } < 
6 3 /2. Therefore, 

Ra 3 (c)( 3, B) < (X + M x + M 2 + M 3 )/(M 1 + M 2 + M 3 ) 

< 1 + 6 3 /(2(6i +b 2 + b 3 )). 

Case 2. Only one bin is light. If the idle space X of the light bin is at most a or 
the total excess of the two heavy bins is at most a, we have the competitive ratio 
immediately. We only need to consider the case that X > a and the total excess 
exceeds a. However, We will show that it is impossible. Before considering the 
following two cases, we assume that M, t > 0 for i = 1,2,3. Otherwise, the total 
excess is at most a. 

Subcase 2a) B 3 is light. M 3 = b 3 — X < a = b 3 / 2. Let Y t be the load of Bj 
immediately before it becomes heavy, for i = 1,2. Then bj — Y t > X\ otherwise 
the last item, which makes the total excess of B i and B 2 over a, would have 
been assigned to bin B 3 . Thus all the items in B 3 should have been assigned to 
Bi by the algorithm. It is a contradiction. 

Subcase 2b) B 3 is heavy. Recall that any item size is at most b 3 . There are at 
least two items in B 3 . Assume that a*, is the last item assigned to B 3 . Those 
items assigned to B 3 have size at least X > b 3 / 2. Thus the idle space of B 3 
before a*, is assigned is less than X , which means that ak would have not been 
assigned to B 3 . It is a contradiction. □ 

From Theorems 3.2 and 3.5 we realize that the competitive ratio of algorithm 
A m (a) for unequal bins is better than that for equal bins. 



4 Without the Assumption on Item Sizes 

In this section, we will discuss the problem without the assumption (1) for the 
two-bin case. 

Lemma 4.1. Without the assumption (1), no deterministic on-line algorithms 
can achieve an overall competitive ratio lower than 6/5 for two bins. 

Proof. Let b\ = 3/2 and b 2 = 1. The first item a\ has size of 1/2. If a\ is assigned 
to bin B 2 , two items a 2 , a 3 with size of 1 are coming. If cq is assigned to bin B 3 , 
item a 2 with size of 3/2 is coming. For both cases, the overall competitive ratio 
is at least 6/5. □ 

Lemma 4.2. For any on-line algorithm A the competitive ratio 

( 1 + (46 2 /3 - bi)/(bi + b 2 ), if b\ < 7b 2 /6, 

Ra{ 2, B) > \ 1 + (&i — b 2 )/(bi + b 2 ) , if 7b 2 /6 < b 3 < 46 2 /3, 

y 1 + b 2 /(3(bi + b 2 )), if b\ > 46 2 /3. 

Proof. Let A be any on-line algorithm. Consider first the case that bi > 46 2 /3 
with the following instance L. The first two items have size of b 2 / 3. If both 
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items go to B i, the next item has size of b\ ; if both items go to B 2 , two items 
with size of 262/3 and 61 — 62/3, respectively, are coming; if the first two items 
go to different bins, the next item has size of 61. For any of the above cases, 
A L (2,B) > 61 + 46 2 /3 and the optimal value OPT L (2,B ) = 61 + b 2 . It implies 
that the lemma is true. 

Now turn to the case that b\ < 462/3. Let 61 = 62 + x, where 0 < x < b 2 /3. 
The lemma follows from Lemma 3.3 by setting k = 3. □ 

We consider on-line algorithms. Assume that all items have size of at most 
61 (61 > 62). 



Theorem 4.3. The competitive ratio of the list scheduling algorithm LS for two 
bins is 1 + min{62, 6i/2}/(6i + b 2 ). 

Proof. We first show the upper bound. Without loss of generality, we consider 
the case only one bin is heavy. 

Case 1. 6i/2 > b 2 . Since the LS algorithm assigns items to the bin with largest 
idle space. Recall that the total size of items is not less than the total size of 
bins, we get that the idle space is at most b 2 . Then, LSl{ 2,,B) < Eft + b 2 , 
OPT l ( 2,B) > J2Pi >h+ 62, which implies R LS {2.,B) < 1 + 6 2 /(6 i + b 2 ). 

Case 2. 6i/2 < b 2 . Let X be the idle space and T be the excess. We only 
consider the case that both X and T are greater than b\/2. Otherwise, it follows 
that Rls( 2) #) < 1 + 6i/(2(6i + 6 2 )). Let p n be the size of the last item assigned 
to the bin which has an excess over 6i/2. Recall that all items have size of at 
most 61. Let Yj be the load of bin Bi before the last item is assigned. Then 
bi — Yi > X > 6i/2, since p n < 61. It implies that either X or T is at most b\/2. 

The following simple instance shows that the bound is tight. Consider three 
items with sizes of min{62, 6i/2}, max{0,62 — 6i/2}, and 61, respectively. The 
optimal value is 61 + b 2 while LS costs 61 + 6 2 + min{6 2 , 6i/2}. □ 

In the following, we present an on-line algorithm to improve the upper bound 
in some cases. 

Algorithm A 2: 



— If 61 < 462 /3, apply algorithm A 2 (a) by setting a = b 2 /3; 

— if 462/3 < 61 < 262, apply A 2 (a) by setting a = 61 — 62; 

— if 61 > 262, apply algorithm LS. 

Theorem 4.4. The competitive ratio of algorithm A2 is 

1 + 62/(3(61 + 62)), if 61 < 462/3, 

1 + (61 — 6 2 )/(6 i + 62), if 462/3 < 61 < 262, 

1 + 62 / (61 + 62 ) , if 61 > 262 . 



Ras(2,B) < 
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Proof. Consider the following cases: 

Case 1. b\ < 46 2 /3. We use A 2 (a), by setting a = 6 2 /3. The proof is similar as 
the one of Theorem 3.2. 

Case 2. 4& 2 /3 < b\ < 2 b 2 . We only need to consider the case that exactly one 
bin is heavy, the excess of the heavy bin is over bi~b 2 , and the idle space of the 
light bin is larger than b\ — b 2 . However, we will show this case is impossible. 
Before the last item a n , which makes the excess over b\ — b 2 , is assigned, if B 2 
is empty, then the excess is at most b\ — b 2 after assigning a n . Assume that B 2 
is not empty before a n is assigned. Note that B i has an idle space larger than 
b\ — b 2 before a n is assigned. It means that the total size of items in B 2 is more 
than 2(6 1 -6 2 ). Then the idle space of bin B 2 is at most b 2 — 2{bi — b 2 ) < (fq — & 2 ). 
In this case a n is assigned to B\ . B 2 is a light bin and the idle space is less than 
bi — b 2 . It is a contradiction. 

Case 3. b\ > 2 b 2 . It follows from Theorem 4.3. □ 

Final Remarks. There are many open questions. Although we have proved 
that the algorithm A 2 (a) is optimal in terms of the overall competitive ratio, 
it is still open to find a best possible on-line algorithm for two bins in terms 
of the competitive ratio. It is interesting to design an algorithm with a better 
competitive ratio than LS algorithm for any number of bins. The problem with- 
out the assumption (1) becomes more difficult. We only proved the competitive 
ratio of LS for the two-bin case. Any improvement on the lower bounds is also 
of interest. 
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Abstract. We consider the range assignment problem in ad-hoc wire- 
less networks in the context of selfish agents: a network manager aims in 
assigning transmission ranges to the stations so to achieve a suitable net- 
work with a minimal overall energy, stations are not directly controlled 
by the manager and may refuse to transmit with a certain transmission 
range because this results in a power consumption proportional to that 
range. 

We investigate the existence of payment schemes which induce the sta- 
tions to cooperate with a network manager computing a range assign- 
ment, that is, truthful mechanisms for the range assignment problem. 



1 Introduction 

One of the main benefits of ad-hoc wireless networks relies in the possibility 
of communicating without any fixed infrastructure. Indeed, it just consists of a 
set of stations which communicate with each other via radio transmitters and 
receivers. Due to the limited power of the stations, multi-hop transmissions are 
in general unavoidable. That is, instead of sending the message directly from the 
source station to the sink station, the message is sent via intermediate stations. 
Another benefit of multi-hop transmissions is that they reduce the overall energy 
required by the communication. 

Assigning transmission powers to stations which (i) guarantee a “good” com- 
munication between stations, and (ii) minimize the overall power consumption 
of the network gives rise to interesting algorithmic questions. In particular, these 
two aspects yield a class of fundamental optimization problems, denoted as range 
assignment problems, that has been the subject of several works in the area of 
wireless network theory [7,5,4]. 
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Unfortunately, the inherent self- organized nature of ad-hoc networks can 
complicate these problems even more. Since there is no central authority con- 
trolling the stations, we have to assume that they will act as selfish agents. That 
is, they will always try to optimize their own benefit, without considering the 
social cost of their behavior. This is exploited in the following model. 

2 The Model 

Range assignments without selfish stations. The instance of the range assign- 
ment problem consists of a complete weighted digraph G(S , S x S) and a graph 
property n. The set S denotes the set of stations. For i,j £ S, let C* be the 
weight of the arc from station i to station j. This can be interpreted as the cost 
for sending a message from station i to station j. 

The goal is to find the cheapest subgraph G(S , E) that satisfies the graph 
property 7r. In general, we just write E to denote the subgraph. 

The cost of E, also called communication graph, differs slightly from the one 
for wired networks. Let E l C E be the set of links emanating from station i and 
let C l = { C \ , . . . , C l n } be the cost vector or them. Since station i can send a 
message to all its out-neiglrbors, provided it uses enough power to reach all of 
them, its cost depend only on the most expensive arc it has to provide. The cost 
of station i for maintaining the communication graph E, denoted by cost*, is 
thus 

cost HC\E):= max C\. 

j:(i,j)GE 3 

Hence, the total cost of the range assignment E is 

cost ({C 1 , . . . , C n }, E)=Y^ cost*(< T, E). 

ies 

Range assignment with selfish stations. We consider each station as a selfishly 
acting agent that privately knows part of the input: station i privately knows C l 
that the manager must use for the computation of a feasible solution. 

We assume that the range assignment is chosen by a network manager. In 
order to choose a low-cost solution, the manager needs information about the 
cost of the connections. So, in the first phase, every station i sends a vector 
D' = {D \, . . . , D l n ) to the network manager, where D'- denotes its declared cost 
for maintaining its link to station j. In an ideal world, we could assume that 
station i just sends the true values C l . But since we assume that the station act 
selfishly, we have to assume that they lie to the network manager by declaring 
D l yf C l . In the second phase, the range assignment is computed on the declared 
values and then implemented by informing the stations about E. 

However, in order to be successful in this model, one needs to convince the 
stations to always tell the truth. This can be achieved by a so-called mechanism. 
A mechanism for a range assignment problem is a pair (alg, P), where ALG is an 
algorithm that, on input T> = {D 1 , . . . , D n }, returns a feasible range assignment 
E = ALG(P) and a payment vector P = P(T>, E) = {P 1 , . . . , P™}, consisting of 
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a payment for each station. Hence, agents, being selfish, will try to maximize 
their utility 

U* = P\V, E) - cost^C*, E), i=l,...,n. 

A mechanism (alg,P) is called truthful if any agent i can always maximize 
her utility by declaring the truth, i.e., when D' = C l . This has to hold for all 
possible declarations of the other agents. 

3 Truthful VCG Mechanisms 

The theory of mechanism design dates back to the seminal papers by Vickrey [10], 
Clarke [3] and Groves [6]. Their celebrated VCG mechanism is still the promi- 
nent technique to derive truthful mechanisms for many problems (e.g., shortest 
path, minimum spanning tree, etc.). In particular, when applied to combinato- 
rial optimization problems (see e.g., [8,9]), the VCG mechanisms guarantee the 
truthfulness under the hypothesis that the optimization function is utilitarian, 
that is, the optimization function is equal to the sum of the single agents’ valu- 
ations, and that ALG computes always the optimum. The second condition can 
be weakened to the following [9]. 

Property I: An algorithm ALG has property I if the solution it returns is optimal 
among all the solutions in its output set. 

Let us denote by V~ 1 the set of declarations T>\D l . The payment scheme is of 
the form 

P i = cost j {D j ,E) + h l {V~ i ), 
iVi 

where h l (-) is any function independent from D l . Intuitively, these mechanisms 
achieve truthfulness by relating the utility of an agent with the total cost of the 
solution chosen by the mechanism: The better the solution the higher the utility. 
In order to formalize this statement let us consider the utility of agent i. 



U* = P i - cost* (C*,E) 

= h*{V~*) - cost i (C i ,E) - cost j (D j ,E) 

iVi 

= h*(V~*) - cost((2? _i ; C*), E). 

Since /i*(X> - *) is independent of D l , agent i tries to minimize cost((2? - *; C l ), E). 
Hence, his declaration should be chosen such that the algorithm returns a so- 
lution E which minimizes cost((2?”*; C l ), E). This can be achieved simply by 
declaring the truth, assuming that the mechanism has property I. 

Many range assignment problems are known to be Np-hard or even log APX- 
hard. We therefore cannot expect to find an algorithm that always returns the 
optimal solution. Therefore, we have to go for property I. The main problem we 
are faced with in the area of truthful mechanisms for range assignment problems 
thus is to find good approximation algorithms ALG with property I. 
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4 Previous Related Work and Open Problems 

Range assignment problems are widely studied. For a survey, the reader is re- 
ferred to [4]. A very interesting version is the Euclidean version, where the sta- 
tions are point in the Euclidean plane and the cost to send a message from i to 
j is Cj = dist(i, j) a , where dist(i,j) is the Euclidean distance between i and j 
and a > 1 is a constant. This model admits a polynomial-time 2-approximation 
algorithm [7]. 

The only case in which a mechanism is known is the case a = 1. There, 
the v/ ^ +1 -approximation algorithm by Calinescu and Zaragoza [2] can be turned 
into a mechanism [1]. Also in [1], it was observed that the approximation ratio 
of this algorithm was improved to 1.5. 

Several interesting problems are open in this area. First of all, in would be 
nice to design mechanisms for other types of range assignment problems. Fur- 
thermore, one could think of different models of privacy. For instance, one can 
assume that the protocol has a partial knowledge of the network topology. Fi- 
nally, even though the VCG method is the major technique in order to obtain 
efficient truthful rewarding mechanism, a fundamental future research is to de- 
velop alternative methods to manage selfish behavior in the context of energy 
consumption in wireless networks. 
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Wireless networks have received significant attention during the recent years. 
Especially, ad hoc wireless networks emerged due to their potential applications 
in battlefield, emergency disaster relief, etc. [13]. Unlike traditional wired net- 
works or cellular wireless networks, no wired backbone infrastructure is installed 
for ad hoc wireless networks. 

A node (or station) in these networks is equipped with an omnidirectional 
antenna which is responsible for sending and receiving signals. Communication 
is established by assigning to each station a transmitting power. In the most 
common power attenuation model [13], the signal power falls as l/r“, where r 
is the distance from the transmitter and a is a constant which depends on the 
wireless environment (typical values of a are between 1 and 6). So, a transmitter 
can send a signal to a receiver if > 7 where P s is the power of the 

transmitting signal, d(s, t) is the Euclidean distance between the transmitter 
and the receiver, and 7 is the receiver’s power threshold for signal detection 
which is usually normalized to 1. So, communication from a node s to another 
node t may be established either directly if the two nodes are close enough and s 
uses adequate transmitting power, or by using intermediate nodes. Observe that 
due to the nonlinear power attenuation, relaying the signal between intermediate 
nodes may result in smaller power consumption. 

A crucial issue in ad hoc wireless networks is to support communication pat- 
terns that are typical in traditional networks. These include broadcasting, multi- 
casting, and gossiping (all-to-all communication). Since establishing a commu- 
nication pattern strongly depends on the power levels, the important engineering 
question to be solved is to guarantee a desired communication pattern minimiz- 
ing the total power consumption. In this work, we consider a series of power 
consumption problems which we formulate below. 

Consider a complete directed graph G = ( V , E), where |U| = n, with a non- 
negative edge cost function c : E — » R + . Given a non-negative node weight 
assignment w : V — > R + , the transmission graph G w is the directed graph 
defined as follows. It has the same set of nodes as G and a directed edge (it, v ) 
belongs to G w if the weight assigned to node u is at least the cost of the edge 
(■ u,v ), i.e., w(u) > c(u,v). Intuitively, the weight assignment corresponds to the 

* This work was partially supported by the European Union under 1ST FET Project 
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power levels at which each node operates (i.e. , transmits messages) while the 
cost between two nodes indicates the minimum power required to send messages 
from one node to the other. Usually, the edge cost function is symmetric (i.e., 
c(u,v) = c(v,u)). In ideal cases, we can assume that the nodes of the graph 
are points in a Euclidean space and the cost of an edge between two nodes is 
the Euclidean distance of the corresponding points raised to a constant power. 
Asymmetric edge cost functions can be used to model medium abnormalities or 
batteries with different power levels [12]. 

The problems we study can be generally stated as follows. Given a complete 
directed graph G = (V, E), where | V| = n, with non-negative edge costs c : E — » 
f? + , find a non-negative node weight assignment w : V — > R + such that the 
transmission graph G w maintains a connectivity property and the sum of weights 
is minimized. Such a property is defined by a requirement matrix R = (r^j £ 
{0, 1} where rq is the number of directed paths required in the transmission 
graph from node Uj to node Vj. 

In Minimum Energy Multicast Tree (MEMT), the connectivity prop- 
erty is defined by a root node vq and a set of nodes D C V— {x^o} such that r;, j = 1 
if i = 0 and Vj £ D and = 0, otherwise. The Minimum Energy Broad- 
cast Tree (MEBT) is the special case of MEMT with D = V — {i>o}. Liang 
[12] presents an intuitive reduction of any instance of MEMT to an instance of 
Directed Steiner Tree (DST). An 0(|.D| e )-approximation algorithm for any 
e > 0 then follows by using an approximation algorithm for DST 1 . The authors 
show in [4] that MEMT is also at least as hard as DST (this is also observed in 
[2]). Using a recent inapproximability result for DST, we obtain an 0(ln 2 ~ e n) 
inapproximability result for MEMT. Hence, improvements on the approxima- 
bility bounds of MEMT strongly depend on new results for DST. It is strongly 
believed that poly logarithmic approximation algorithms for the latter do exist. 
In symmetric graphs, there exists an 0(hr |D ^-approximation algorithm. This 
is shown in [4] where instances of MEMT in symmetric graphs are reduced to 
instances of Node- Weighted Steiner Tree, a problem which is known to 
be approximable within a logarithmic factor. Note that this result is asymptot- 
ically optimal since it can be easily seen that MEMT in symmetric graphs is 
at least as hard to approximate as Set Cover [5]. MEBT is easier and can 
be approximated within O(lnn) as it was recently shown in [4] and [2], In [4], 
the authors show that the corresponding instance of DST has some very good 
properties and can be approximated within a logarithmic factor. Calinescu et 
al. [2] achieve a similar approximation bound by a simpler algorithm which con- 
structs a tree incrementally. Sophisticated set-covering techniques are used for 
the analysis. Constant approximation algorithms for the special case of MEBT 
where the nodes of the input graph correspond to points in the Euclidean plane 
are presented in [5,14]. The special case of MEBT where the nodes are points 
in a line can be solved in polynomial time [3,6]. 



1 Due to lack of space, references to known results for combinatorial problems which 

are crucial in the proofs of the results presented have been omitted from this survey. 
The interested reader may see the papers cited here and the references therein. 
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In Minimum Energy Steiner Subgraph (MESS), the requirement matrix 
is symmetric. Alternatively, we may define the problem by a set of nodes D C V 
partitioned into p disjoint subsets D\, £> 2 , D p . The entries of the requirement 
matrix are now defined as 'i\ :l = 1 if £ D & for some k and r,^ = 0, oth- 

erwise. The Minimum Energy Subset Strongly Connected Subgraph 
(MESSCS) is the special case of MESS with p = 1 while the Minimum Energy 
Strongly Connected Subgraph (MESCS) is the special case of MESSCS 
with D = V (i.e., the transmission graph is required to span all nodes of V and 
to be strongly connected). Although the approximability of MESS is yet un- 
known, MESSCS has the same approximability properties as MEMT, i.e., it can 
be approximated within 0(\D\ e ) but cannot be approximated within 0(ln 2-c n). 
Whether there are polylogarithmic approximations for this problem is an inter- 
esting open problem and strongly depends on new results for DST. MESCS 
can be approximated within a logarithmic factor using an algorithm for MEBT 
as a subroutine. This result is asymptotically optimal. In symmetric graphs, 
MESS, MESSCS and MESCS have constant approximation algorithms using al- 
gorithms for network design problems as a subroutine [4] (see also [11]). The 
respective bounds are a 2-approximation algorithm for MESCS using optimal 
Spanning Tree solutions (this was first proved by Kirousis et al. in [10]), a 
2p-approximation algorithm for MESSCS using p-approximate Steiner Tree 
solutions, and a 4-approximation algorithm for MESS using a 2-approximation 
algorithm for Steiner Forest. Approximation schemes are unlikely to exist for 
these problems in symmetric graphs since inapproximability bounds have been 
proved for MESCS [8] using approximation-preserving reductions from Vertex 
Cover in bounded-degree graphs. The special case of MESCS where the nodes 
are points in a line can be solved in polynomial time [7]. 

The authors of [1] study MESCS under the extra requirement that the trans- 
mission graph contains a bidirected subgraph which maintains the connectiv- 
ity requirements of MESCS. By adding this extra requirement to MESS and 
MESSCS, we obtain the bidirected MESS and bidirected MESSCS, respectively. 
In [4] , it is shown that instances of bidirected MESS can be reduced to instances 
of Node- Weighted Steiner Forest, a problem for which logarithmic approx- 
imations exist. As corollaries, we obtain 0 { In |D|)- and 0(ln ?r)-approximation al- 
gorithms for bidirected MESSCS and bidirected MESCS, respectively. A slightly 
inferior logarithmic approximation bound for bidirected MESCS is presented 
in [2] using different techniques. These results asymptotically match the inap- 
proximability result for bidirected MESCS of Althaus et al. [1]. For symmetric 
graphs, the positive and negative results for MESS, MESSCS, and MESCS hold 
for their bidirected versions as well. Better approximation algorithms for bidi- 
rected MESCS are presented in [1]. 

We have surveyed results on power consumption problems where the connec- 
tivity requirements are defined by 0-1 requirement matrices. A natural extension 
is to consider matrices with non-negative integer entries rij denoting that at least 
rij node-disjoint paths are required from node Vi to node Vj . This extension leads 
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to interesting power consumption problems for fault torelant wireless communi- 
cation. Some recent results on such problems can be found in [9, 11]. 
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Abstract. This work is motivated by the need for approximation al- 
gorithms for the integral multicommodity flow problem which arise in 
numerous optimization scenarios, including the design of telecommuni- 
cation networks. We improve on one of the most efficient known com- 
binatorial approximation algorithm for fractional multicommodity flow 
by using an incremental approach. This approach is validated by exper- 
imental results, which show a significant speed-up. 

Keywords: Multicommodity flow, combinatorial approximation, dy- 

namic shortest paths, incremental algorithm. 



The multicommodity flow problem is useful in numerous applications, especially 
when the issue is to compute paths for entities that are concurrent for some 
resources. In these cases, the resources are modeled by capacities on the edges 
of the graph bearing all possible routings. 

If the multicommodity flow is constrained to be integer, the problem is AfV- 
hard. Nevertheless, Raghavan [Rag94] proposed a randomized algorithm con- 
structing a multicommodity flow based on the randomized rounding of the linear 
relaxation of the problem, namely the fractional multicommodity flow. Recently 
an improvement of this algorithm has been proposed [CR02]. This algorithm 
computes a better approximation of the integer multicommodity flow, but at 
the cost of solving a fractional multicommodity flow for each commodity in an 
iterative process. Solving the fractional multicommodity flow efficiently is there- 
fore a crucial issue. 

Given that the fractional multicommodity flows is used to compute an ap- 
proximated integral solution, there is no real need for the use of optimal frac- 
tional flows. Indeed, the approximation yielded by the randomized rounding 
process is an additive gap of order the square root of the fractional capacities 
[Rag94,CR02]. If these fractional capacities are (1 + e)-approximations of the 
optimal capacities, the magnitude of the final gap will not change dramatically. 
It is therefore natural to exploit this freedom in order to consequently speed up 
the integer multicommodity flow approximation algorithm. A first article [GK98] 
proposes a combinatorial algorithm computing a fractional multicommodity flow 
(l + e)-approximation in time 0(e^ 2 km 2 n 2 ), where k is the number of commodi- 
ties, n and m are the number of vertices and edges of the flow network. It is 
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then possible to avoid the linear program solvers, much more costly to run. This 
algorithm has been improved by Fleischer [FleOO], decreasing the complexity 
to 0(e _2 m 2 log ?r). Both algorithms are mainly based on sequences of shortest 
path computations. In a previous paper [BCL + 03], we gave an improvement on 
Fleischer’s algorithm using dynamic shortest paths algorithms. 

In the following we propose an incremental strategy to improve on the run- 
ning time of these algorithms. We then investigate on our approach through 
practical tests on different flow networks. We first present the algorithm that we 
improve. 



1 Combinatorial Multicommodity Flow Approximation 

The (1 + e)-approximation algorithm for fractional multicommodity flow 
[BCL + 03] is based on a combinatorial understanding of the dual of the mul- 
ticommodity flow edge-path linear program. This dual builds a length function 

1 on the edges of the graph such that the length of an edge is related to the 
amount of flow it bears. 

The process starts by assigning to each edge the same initial length <5 > 0, 
a constant which depends on the parameters of the algorithm, including the 
approximation factor e, and corresponds to a null amount of flow. Then, the 
algorithm pushes flow iteratively along single source shortest paths (sssp) for 
each commodity while the length of each edge of the paths is increased. The 
loop ends when every shortest paths from the source of a commodity to its 
destination are of length more than 1. At this step, the analysis shows that the 
flow scaled by logi+ t (^4^) fits the capacities, and that the worst case complexity 

is O ^ n _|_ nlogn)^ . One may notice that this complexity is the same as 
the one given by Fleischer [FleOO] , even though the use of dynamic shortest paths 
algorithms yields an improvement of the running time. As a matter of fact, the 
speed up provided by such algorithms cannot be precisely evaluated unless the 
structure of the flow network is known. 

In the following, we try to cope with the e -2 factor in the complexity. 

2 An Incremental Approach 

In order to ensure a (1 + e)-approximation, each iteration of the previous algo- 
rithm pushes a very small amount of flow. We propose an incremental approach 
based on the idea that the precision is only required at the end of the pro- 
cess. Therefore, our algorithm, reported as Algorithm 1, starts by running the 
previous algorithm with a large eo, which pushes a lot of flow quickly. Instead 
of scaling the flow to fit the capacities, Algorithm 1 scales it with the length 
function so that it looks like an intermediary step of previous algorithm with a 
smaller ei = cq/2. Starting from this intermediary step, the process in finished 
with ei, scaled again for e 2 = ei/2 and so forth, until Algorithm 1 reaches the 
required e/. 




258 



David Coudert, Herve Rivano, and Xavier Roche 



Algorithm 1 Incremental multicommodity flow (1 + e)-approximation 

Require: G = (V,E), \ V\ = n, \E\ = m, C = {(si, U), i = l...k}CV 2 ,e>0 
Ensure: l length function over E s.t. every path Si — > ti has length > 1. 

Ensure: / (1 + e)-approximation of max. multicommodity flow for C on G 
1: {Initializing} t = 4, e(t) = 2 _t , 5(t) = e' (t) (e £(i) n) _ TF, B(t) = log e o(^|), 
Ve € E, 1(e) = S(t), fi(e) = 0 
2: while e(f) > eo do 

3: {Lower bound on the length of a SSSP } A = min e6 e 1(e) 

4: while A < 1 do 

5: for all i = 1 . . . k do 

6: P ^SSSP ( Si — * U) 

7: while l(P) < e^\ do 

8: Cm <— min e gp c(e) 

9: Ve € P, fi(e) <— fi(e) + c m , 1(e) <— l(e)e e(f) TT 

10: P ^Usssp (si — > U) 

11: A <— Ae e(t - ) 

12: tf-t+1 

13: Ve € E, 1(e) <- l(e)-^^e <t) ~ri ( ~r : ^y~ 2)B(t ~ 1) 

14: Ve 6 E, Vi = 1 . . . fc, fi(e) = ^y/i(e) 

15: Vi= l...k,Ve€£, fi(e) = ^ 



3 Experimental Results 

The analysis of the complexity and approximation performances of Algorithm 
1 is a work in progress. Nevertheless, experimental results are encouraging. We 
report on a set of experiments achieved on variable size multicommodity flow 
problems, arising in the setting of routing and wavelength assignment for wave- 
length division multiplexing optical networks [CR02], The results are depicted 
as Figure 1. 




Fig. 1. Running times function of network size (left) or quantity of flow (right). 
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The left hand side of Figure 1 presents the running time of the optimal 
resolution of the flow with CPLEX, the algorithm of [BCL+03] with e varying 
from 1/4 to 1/16, and Algorithm 1 (named CRR) with eo = 1/4 or 1/8 and 
ef = 1/64 or 1/128. One can see that both combinatorial algorithms are much 
faster than CPLEX when the size of the problem increases. These results also 
show that the running time of Algorithm 1 for a given eo equals to the running 
time of Algorithm [BCL+03] for e = e 0 plus something small and linear in ej 1 . 

The right hand side presents the time required by the different combinatorial 
algorithms to produce a given approximation of the maximum multicommodity 
flow. These experiments have been achieved on the network of size 500, but 
the same results are observed with any size. As far as Algorithm 1 for eo = e/ 
is exactly Algorithm [BCL+03] for the same parameter, we obtain this comb 
graph like results. One can then realize that Algorithm 1 is able to produce 
approximated solution much faster than Algorithm [BCL+03]. Moreover, the 
influence of eo on the quality of the solutions of Algorithm 1 is weak, but very 
strong on the running time. 

4 Conclusion 

In this paper, we have proposed a combinatorial incremental (1 + e)-approx- 
imation algorithm for the fractional multicommodity flow problem, improving 
on several algorithms of the literature. Practical experiments yield some con- 
jectures on its complexity and approximation performances. A formal analysis 
of the algorithm is an ongoing work, as well as its integration in an integral 
multicommodity flow approximation algorithm and further improvements. 
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In general, we define a disk graph (DG) as the intersection graph of a set of disks 
in the Euclidean plane. Recently, there has been increasing interest in studying 
the class of DGs. This primary motivated by its applications which can be found 
in radio networks, map labeling, and in sensor networks, just to name a few. 
From another side, DGs have a very simple geometric structure. This motivates 
the study of theoretical problems. Here we give a short survey on DGs. We briefly 
discuss coloring, independent set and clique. We include hardness results, main 
ideas used in approximation and online algorithms, lower and upper bounds. We 
also mention some open questions. 

Disk Graphs. There are a number of variants for a DG to be represented by 
a set of disks. If disks are not allowed to overlap but touch, we have a coin 
graph (CG). If all disk have unit diameter, we have a unit disk graph (UDG). 
If the diameter ratio of disks is bounded by some constant a , we have a cr-disk 
graph (er-DG). For an illustration see Fig. 1. As pointed in [5], UDGs need not 



Unit Disk Graph 




Coin Graph 




2-Graph 




Fig. 1 . Disk graphs 



to be perfect since any odd cycle of length five or grater is a UDG. Similarly, 
UDGs need not be planar; in particular, any clique of size five or more is a UDG. 
From another side, every planar graph is a CG [15]. Thus many of the known 
efficient algorithms for perfect and planar graphs do not apply to (unit, er) DG. 
Unfortunately, the recognition problem of a (unit, cr) DG is also NP-hard [4, 12]. 
So, we always need to distinguish whether an algorithm uses disks as the input 
or not. 
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Independent Set. An independent set (IS) of a graph G = (V, E) is a vertex 
subset V' C V such that for any pair of vertexes in V' , there is no edge between 
them in E. The independence number a(G) is defined as the size of a maximal 
independent set (MIS) in G. 

It is NP-hard to find an MIS in a UDG, even if its disks are given [5,21]. 
However, the neighborhood structure of a unit disk in the plane gives two nice 
ideas, see Fig 2. First, there are at most 5 non-intersecting unit disks around. 
Second, 5 reduces to 3, if the unit disk has the lowest y-coordinate amoung all 
disks. Accordingly, we can use GREEDY: Take disks one by one, put a disk into 





Fig. 2. The neighborhood structure 



an IS if it does not intersect any disk in the IS. (Of course, GREEDY not need 
to use disks.) In the worst case, any disk in the IS blocks at most 5 other disks. 
So, the size of IS is at least a{G )/ 5 [10,16]. Thus GREEDY is a 5-competitive 
algorithm. From Fig 2, no deterministic online algorithm can be better than 
5-competitive. 

In the offline case, the approximation ratio of GREEDY can be improved 
from 5 to 3 [17]. We work in the same way but take disks in the order of non- 
decreasing y-coordinates. Here we also not need to use unit disks. We can always 
find a vertex whose neighborhood does not contain an IS of size larger than 3. 
(A simple enumeration completes in 0(|P| 5 ) time.) We add such a vertex to an 
IS, remove all its neighbors, and repeat the procedure until the input graph is 
empty. So, either disks are used or not, there is a 3-approximation algorithm, 
but the running time differs. 

Regarding disk graphs, we just follow old ideas. In the online case, we use 
GREEDY. Either disks are given or not (some simple arguments), GREEDY is 
an optimal (|Vj — l)-competitive algorithm [6]. In the offline case, we can first 
order disks by non-decreasing size, and then use GREEDY on disks in this order. 
So, we get a 5-approximation algorithm. As before, we also not need to use disks. 
It suffices to identify a vertex whose neighborhood does not contain an IS of size 
at most 6. This also gives a 5-approximation algorithm [17]. 

Finally, given a set of (unit) disks in the plane, we can find an IS of size at 
least (1 — e)a(G), for any fixed e > 0 [14,18,8]. In other words, there exists a 
polynomial time approximation scheme, or a PTAS for short. To be able to cope 
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with the problem, one uses the ideas of the shifting technique [2, 13]. Due to the 
space limitations we omit it. 

Clique. A clique of a graph G = (V, E ) is a vertex subset V' C V such that 
such that for any pair of vertexes in V', there is an edge between them in E. 
The clique number tu(G) is defined as the size of a maximal clique (MC) in G. 

Surprisingly, it turns out that an MC in an UDG can be found in polynomial 
time. Provided unit disks in the plane, the main idea is rather simple [5] . Consider 
two unit disks x and y with centers p x and p y , respectively. Obviously, they 
intersect iff the distance d(p x ,p y ) < 1. Then, let A x>y be the set of all unit disks 
whose centers are in both (closed) circles around p x and p y with radius d(p x ,p y ), 
see Fig 3. Certainly every disk in A xy must intersect both x and y. A xy not 
need to induce a clique, but it must induce complements of a bipartite graph. 
This can be seen by drawing the line between p x and p y . The disks in either 
half intersect pairwise. An MC in complements of a bipartite graph, i.e. an MIS 
in a bipartite graph, can be found in time 0(\V\ 2 ' 5 ), by matching. Any clique 
must be the form of A x y . So, we can find an MC by looking at all A x , y . The 






running time of the algorithm is 0(|U| 4 ' 5 ), but improves to 0(|F| 3 5 ) [3]. Later, 
in [20], these nice ideas were adopted so as to remove the dependence on unit 
disks. First, a greedy algorithm is used to find a cobipartite neighborhood edge 
elimination ordering. That is, an ordering ei, e 2 , . . . , e\E\ such that for each edge 
ei = ( x , y) the set of vertices iVj CV which are adjacent to both x and y in the 
subgraph induced by {e^, ej+i, . . . , e\E\}i induces a cobipartite subgraph in G. 
As one can see, if G is an UDG then such an ordering exists. Again, by matching 
we can find MCs for all N{. An MC of G must be among them. The resulted 
algorithm is also a robust algorithm. Given a graph, it either finds an MC or 
outputs that the given graph is not an UDG. 

Unfortunately, there are no results for general DGs. The complexity of the 
problem is also open. 

Coloring. A coloring of a graph G = (V, E) is an assignment of colors (posi- 
tive integers) to each vertex in V such that no edge in E connects two identi- 
cally colored vertices. The chromatic number x(G) is the defined as the minimal 
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number of colors in a coloring of G. Clearly, we need at least w(G) colors, i.e. 
w(G) < x(G). 

It is NP-complete to decide whether a UDG can be colored with 3 colors, 
even if its disks are given [5]. (So, there is no 4/3-approximation algorithm 
unless P = NP.) Furthermore, deciding fc-colorability remains NP-complete, for 
any fixed k [10]. 

For a UDG, we simply recall the neighborhood structure of a unit disk v 
in V. Clearly, there are at most 5(u>(G) — 1) neighbors of v. So, we can use 
First-Fit: Take vertices one by one, and assign a least available color to the 
vertex. Then, First-Fit uses at most 5c u(G) + 4 colors [6,7]. This gives a 5- 
competitive algorithm. Regarding a lower bound, no algorithm can be better 
than 2-competitive [9]. Later, this bound was improved to 4 [22]. 

Indeed, we can improve the performance of First-Fit in the offline case by 
processing the disks in the plane in the order of non-increasing y-coordinates. 
Similarly, we can show that First-Fit uses at most 3cu(G) — 2 colors [19]. This 
gives a 3-approximation algorithm. We can also adopt the algorithm to the 
case when no geometric information about disks is known. We fist recursively 
order the vertices by non-increasing degree, and then, apply First-Fit by tracking 
vertices back. Again, such the smallest-degree-last First-Fit algorithm uses at 
most 3 u)(G) — 2 colors [10]. So, it achieves approximation ratio 3. 

Interestingly, the smallest-degree-last First-Fit also performs well on general 
DGs. Its approximation ratio can be proven to be 5 [16]. Furthermore, by ex- 
ploring the neighborhood structure of a disk in the plane one can color any DG 
with at most 6tu(G) — 6 colors. 

In the online case, no constant competitive algorithm can be found. As it 
was shown in [7, 11], i?(log | V|) is a lower bound on the competitive ratio of any 
online algorithm, even if it uses disks as the input. From another side, First-Fit 
is log |U|-competitive [6], that gives an optimal online algorithm. 
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Abstract. Energy has emerged as a critical constraint for a large num- 
ber of portable, wireless devices. For data intensive applications, a signif- 
icant amount of energy is dissipated in the memory. Advanced memory 
architectures support multiple power states of memory banks, which can 
be exploited to reduce energy dissipation in the system. We present a 
general methodology using combinatorial graph scheduling techniques, 
which can be used for obtaining efficient memory power management 
schedules for algorithms. Additional techniques like tiling further im- 
prove the efficiency of our approach. Our simulation results show that 
we can obtain over 98% energy reduction in the memory energy for the 
Transitive Closure using our methodology. 
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1 Introduction 

Due to the growing popularity of embedded systems and related [1-3], energy 
has emerged as a new optimization metric for system design. As the power 
availability in most of these systems is limited by the battery power of the 
device, it is critical to reduce energy dissipation in these systems to maximize 
their operation cycle. 

The topic of energy reduction has been intensively studied in the literature 
and is being investigated at all levels of system abstraction, from the physical 
layout to software design. The research at the architecture level has led to inno- 
vations such as the dynamic voltage and frequency scaling of the processor [8] 
and in design of power efficient memory systems. The latter provides support 
for dynamic transition of memory banks into various power states and partial 
activation of rows or columns in memory banks [4]. In order to design energy 
efficient applications it is required that the architectural power management fea- 
tures are exploited at higher levels. Till now, most researchers have focused upon 
the compiler [5-7] and the operation system level, where the target architecture 
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is taken into account, and the optimization techniques are specific to the cho- 
sen application. Some contributions have also been proposed at the algorithmic 
level. The goal is to optimize an algorithm for energy dissipation in the memory 
and other devices [9- 11]. The solutions proposed are again specific to the chosen 
algorithms. 

In this paper, we provide a combinatorial method to schedule memory power 
state in an efficient way. 



2 A Model of Energy Evaluation 

We assume that the energy spent for running an algorithm mainly depends on : 

— the operations performed by the processor (arithmetic and logical operations, 
comparisons, etc...); 

— the operations performed on the memory (read/write operations, storage, 
and state switchings); 

— the operations performed between the processor and the memory (transfers 
between the global memory and the cache). 

Given an algorithm running on an entry of size n, if T(n), S(n ), and C(n) 
denote respectively the time complexity (number of processors operations), the 
space complexity (the amount of memory used), and cache complexity (number 
of cache missing), the energy complexity of the algorithm is approximated by 

E(n) = rT(n) + QQ T S(n) + f2W T C{n). (1) 



3 Our Methodology 

In this section, we describe our methodology to obtain efficient power state 
schedules for the memory system. 



3.1 Timing Function 

Given a tasks graph G = ( X , D ), we present a method to assign each node a valid 
execution time. We assume that each task takes one unit time (a time cycle). 
First, we consider an ordinal correspondence ?y : X — > (1, • • • , |X|} (thus a total 
order in X). Next, we associate to each node x G X a value 'k{x) initialized with 
its in-degree given by 

|{sGX:(a,a:)G£>}|. (2) 

For any subset W C X, the unique node x such that 7r(x) = 0 and tj(x) is minimal 
is denoted by y(W). Hence, the following algorithm provides an execution time 
of each nodes in X. 
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Algorithm 1 . Algorithm for tasks execution times 

for x £ X 

n(x) <— |{s € X : (s,x) £ D}| 
endfor 
S-^X 
T <— |5| 
for t <— 1 to T 
x £- x(S) 
n(x) <— t 
for y £ D(x) 
n(y) £- 7 v(y) - 1 
endfor 
S <r- S - {X} 
endfor 



3.2 Memory Power State Management 

Consider the previous graph model G = (A, D) and a given timing function 
7 r. For a given node x £ X, since each node of D(x) is executed at a dif- 
ferent time, we can write the set D(x) in an execution time ordered form 
{x\, X 2 , ■ ■ ■ , £ c (»}) where c(x) = |-D(a;)| (the out-degree of x). Next, we con- 
sider the set {<5i(a;), 62 ( 2 ), ■ ■ ■ , <5 c (a:)-i}> where 

Sj(x) = 7r(xj +1 ) - Tr(xj). (3) 

Now, consider that the set of memory state is S = {0, 1, • • • , m}, where 0 stands 
for the active state , and corresponding activation cost and storage cost are W = 
(w 0 , W\,W 2 , ■ ■ ■ , Wm) (w 0 = 0) and Q = (go, Qi, ■ ■ • , q m ) respectively. We assume 
that w s < w s+ i, s = 0, ...,m — 1. We emphasize on the fact that 0 stand for 
active state and m stand for inactive state. 

For a given j £ {1, ■ ■ ■ , c(x) — 1}, we consider 

a(j) = min{s, 0 < s <m : Sj(x)q s > w s }. (4) 

Since wo = 0, a(j) is well defined for each j, and it indicates the best memory 
state for x in the time interval between the computation of Xj and that of Xj+ 1 . 

4 Simulation Results 

Lastly, we present our simulation results. The simulations were performed using 
the simulator described in [9]. Table 1 shows our results for Transitive Closure, 
where a gain of 98% was obtained. 

5 Conclusion 



We have proposed a combinatorial method to schedule tasks graph with an em- 
phasize on energy dissipation. Although the method is applied on a graph model, 
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Table 1. Simulation Results with the Transitive Closure 



n 


T 


T 


T—T 

T 


E 


E 


E-E 

E 


32 


0.012 


0.12 


0.012 


0.052 


0.0006 


0.987 


64 


0.090 


0.092 


0.014 


0.399 


0.0056 


0.985 


128 


0.723 


0.738 


0.020 


3.143 


0.0562 


0.982 


256 


5.830 


5.954 


0.021 


25.308 


0.4644 


0.981 



it can be also used with recurrence equations specifications through appropri- 
ate syntaxical manipulations. Simulation results show that the method yields 
significant energy reduction. We think that the study of a dynamic version of 
the method would be interesting. Tiling issue should be analyzed in more de- 
tails, since it is the key for successful optimization through data/instructions 
remapping. 
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